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SCHIZOPHRENIA ASSOCIATED GENE, PROTEINS AND BIALLELIC 

MARKERS 

FIELD OF THE INVENTION 

The invention concerns the human g35030 gene, and g35030 polynucleotides, 
polypeptides, as well as biallelic markers localized in the g35030 gene and in the human 
chromosome 13q31-q33 region. The invention also concerns the association established 
between schizophrenia and bipolar disorder and the biallelic markers and the g35030 gene and 
nucleotide sequences. The invention provides means to identify compounds useful in the 
treatment of schizophrenia, bipolar disorder and related diseases, means to determine the 
predisposition of individuals to said disease as well as means for the disease diagnosis and 
prognosis. 

BACKGROUND OF THE INVENTION 

Advances in the technological armamentarium available to basic and clinical 
investigators have enabled increasingly sophisticated studies of brain and nervous system 
function in health and disease. Numerous hypotheses both neurobiological and 
pharmacological have been advanced with respect to the neurochemical and genetic 
mechanisms involved in central nervous system (CNS) disorders, including psychiatric 
disorders and neurodegenerative diseases. However, CNS disorders have complex and poorly 
understood etiologies, as well as symptoms that are overlapping, poorly characterized, and 
difficult to measure. As a result future treatment regimes and drug development efforts will be 
required to be more sophisticated and focused on multigenic causes, and will need new assays 
to segment disease populations, and provide more accurate diagnostic and prognostic 
information on patients suffering from CNS disorders. 

Genetic Analysis of Complex Traits 

Until recently, the identification of genes linked with detectable traits has relied mainly 
on a statistical approach called linkage analysis. Linkage analysis is based upon establishing a 
correlation between the transmission of genetic markers and that of a specific trait throughout 
generations within a family. Linkage analysis involves the study of families with multiple 
affected individuals and is useful in the detection of inherited-traits, which are caused by a 
single gene, or possibly a very small number of genes. But, linkage studies have proven 
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difficult when applied to complex genetic traits. Most traits of medical relevance do not follow 
simple Mendel ian monogenic inheritance. However, complex diseases often aggregate in 
families, which suggests that there is a genetic component to be found. Such complex traits are 
often due to the combined action of multiple genes as well as environmental factors. Such 
complex trait, include susceptibilities to heart disease, hypertension, diabetes, cancer and 
inflammatory diseases. Drug efficacy, response and tolerance/toxicity can also be considered as 
multifactoral traits involving a genetic component in the same way as complex diseases. 
Linkage analysis cannot be applied to the study of such traits for which no large informative 
families are available. Moreover, because of their low penetrance, such complex traits do not 
segregate in a clear-cut Mendelian manner as they are passed from one generation to the next. 
Attempts to map such diseases have been plagued by inconclusive results, demonstrating the 
need for more sophisticated genetic tools. 

Knowledge of genetic variation in the neuronal and endocrine systems is important for 
understanding why some people are more susceptible to disease or respond differently to 
treatments. Ways to identify genetic polymorphism and to analyze how they impact and predict 
disease susceptibility and response to treatment are needed. 

Although the genes involved in the neuronal and endocrine systems represent major 
drug targets and are of high relevance to pharmaceutical research, we still have scant knowledge 
concerning the extent and nature of, sequence variation in these genes and their regulatory 
elements. In the case where polymorphisms have been identified the relevance of the variation 
is rarely understood. While polymorphisms hold promise for use as genetic markers in 
determining which genes contribute to multigenic or quantitative traits, suitable markers and 
suitable methods for exploiting those markers have not been found and brought to bare on the 
genes related to disorders of the brain and nervous system. 

The basis for accomplishment of these goals is to use genetic association analysis to 
detect markers that predict susceptibility for these traits. Recently, advances in the fields of 
genetics and molecular biology have allowed identification of forms, or alleles, of human genes 
that lead to diseases. Most of the genetic variations responsible for human diseases identified so 
far, belong to the class of single gene disorders. As this name implies, the development of 
single gene disorders is determined, or largely influenced, by the alleles of a single gene. The 
alleles that cause these disorders are, in general, highly deleterious (and highly penetrant) to 
individuals who carry them. Therefore, these alleles and their associated diseases, with some 
exceptions, tend to be very rare in the human population. In contrast, most common diseases 
and non-disease traits, such as a physiological response to a pharmaceutical agent, can be 
viewed as the result of many complex factors. These can include environmental exposures 
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(toxins, allergens, infectious agents, climate, and trauma) as well as multiple genetic factors. 

Association studies seek to analyze the distributions of chromosomes that have 
occurred in populations of unrelated (at least not directly related) individuals. An assumption in 
this type of study is that genetic alleles that result in susceptibility for a common trait arose by 
ancient mutational events on chromosomes that have been passed down through many 
generations in the population. These alleles can become common throughout the population in 
part because the trait they influence, if deleterious, is only expressed in a fraction of those 
individuals who carry them. Identification of these "ancestral" chromosomes is made difficult 
by the fact that genetic markers are likely to have become separated from the trait susceptibility 
allele through the process of recombination, except in regions of DNA which immediately 
surround the allele. The identities of genetic markers contained within the fragments of DNA 
surrounding a susceptibility allele will be the same as those from the ancestral chromosome on 
which the allele arose. Therefore, individuals from the population who express a complex trait 
might be expected to carry the same set of genetic markers in the vicinity of a susceptibility 
allele more often than those who do not express the trait; that is these markers will show an 
association with the trait. 
Schizophrenia 

Schizophrenia is one of the most severe and debilitating of the major psychiatric 
diseases. It usually starts in late adolescence or early adult life and often becomes chronic and 
disabling. Men and women are at equal risk of developing this illness; however, most males 
become ill between 16 and 25 years old, while females develop symptoms between 25 and 30. 
People with schizophrenia often experience both "positive" symptoms (e.g., delusions, 
hallucinations, disorganized thinking, and agitation) and "negative" symptoms (e.g., lack of 
drive or initiative, social withdrawal, apathy, and emotional unresponsiveness). 

Schizophrenia affects 1% of the world population. There are an estimated 45 million 
people with schizophrenia in the world, with more than 33 million of them in the developing 
countries. This disease places a heavy burden on the patient's family and relatives, both in 
terms of the direct and indirect costs involved and the social stigma associated with the illness, 
sometimes over generations. Such stigma often leads to isolation and neglect. 

Moreover, schizophrenia accounts for one fourth of all mental health costs and takes up 
one in three psychiatric hospital beds. Most schizophrenia patients are never able to work. The 
cost of schizophrenia to society is enormous. In the United States, for example, the direct cost 
of treatment of schizophrenia has been estimated to be close to 0.5% of the gross national 
product. Standardized mortality ratios (SMRs) for schizophrenic patients are estimated to be 
two to four times higher than the general population, and their life expectancy overall is 20 % 
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shorter than for the general population. The most common cause of death among schizophrenic 
patients is suicide (in 10 % of patients) which represents a 20 times higher risk than for the 
general population. Deaths from heart disease and from diseases of the respiratory and 
digestive system are also increased among schizophrenic patients. 
Bipolar Disorder 

Bipolar disorders are relatively common disorders with severe and potentially disabling 
effects. In addition to the severe effects on patients' social development, suicide completion 
rates among bipolar patients are reported to be about 1 5%. 

Bipolar disorders are characterized by phases of excitement and often including 
depression; the excitement phases, referred to as mania or hypomania, and depression can 
alternate or occur in various admixtures, and can occur to different degrees of severity and over 
varying time periods. Because bipolar disorders can exist in different forms and display 
different symptoms, the classification of bipolar disorder has been the subject of extensive 
studies resulting in the definition of bipolar disorder subtypes and widening of the overall 
concept to include patients previously thought to be suffering from different disorders. Bipolar 
disorders often share certain clinical signs, symptoms, treatments and neurobiologies features 
with psychotic illnesses in general and therefore present a challenge to the psychiatrist to make 
an accurate diagnosis. Furthermore, because the course of bipolar disorders and various mood 
and psychotic disorders can differ greatly, it is critical to characterize the illness as early as 
possible in order to offer means to manage the illness over a long term. 

Bipolar disorders appear in about 1.3% of the population and have been reported to 
constitute about half of the mood disorders seen in a psychiatric clinic. Bipolar disorders have 
been found to vary with gender depending of the type of disorder; for example, bipolar disorder 
I is found equally among men and women, while bipolar disorder II is reportedly more common 
in women. The age of onset of bipolar disorders is typically in the teenage years and diagnosis 
is typically made in the patient's early twenties. Bipolar disorders also occur among the elderly, 
generally as a result of a medical or neurological disorder. 

The costs of bipolar disorders to society are enormous. The mania associated with the 
disease impairs performance and causes psychosis, and often results in hospitalization. This 
disease places a heavy burden on the patient's family and relatives, both in terms of the direct 
and indirect costs involved and the social stigma associated with the illness, sometimes over 
generations. Such stigma often leads to isolation and neglect. Furthermore, the earlier the onset, 
the more severe are the effects of interrupted education and social development. 

The DSM-IV classification of bipolar disorder distinguishes among four types of 
disorders based on the degree and duration of mania or hypomania as well as two types of 
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disorders which are evident typically with medical conditions or their treatments, or to 
substance abuse. Mania is recognized by elevated, expansive or irritable mood as well as by 
distractability, impulsive behavior, increased activity, grandiosity, elation, racing thoughts, and 
pressured speech. Of the four types of bipolar disorder characterized by the particular degree 
and duration of mania , DSM-IV includes: 

- bipolar disorder I, including patients displaying mania for at least one week; 

- bipolar disorder II, including patients displaying hypomania for at least 4 days, 
characterized by milder symptoms of excitement than mania, who have not previously 
displayed mania, and have previously suffered from episodes of major depression; 

- bipolar disorder not otherwise specified (NOS), including patients otherwise 
displaying features of bipolar disorder II but not meeting the 4 day duration for the excitement 
phase, or who display hypomania without an episode of major depression; and 

- cyclothymia, including patients who show numerous manic and depressive symptoms 
that do not meet the criteria for hypomania or major depression, but which are displayed for 
over two years without a symptom-free interval of more than two months. 

The remaining two types of bipolar disorder as classified in DSM-VI are disorders 
evident or caused by various medical disorder and their treatments, and disorders involving or 
related to substance abuse. Medical disorders which can cause bipolar disorders typically 
include endocrine disorders and cerebrovascular injuries, and medical treatments causing 
bipolar disorder are known to include glucocorticoids and the abuse of stimulants. The disorder 
associated with the use or abuse of a substance is referred to as "substance induced mood 
disorder with manic or mixed features". 

Diagnosis of bipolar disorder can be very challenging. One particularly troublesome 
difficulty is that some patients exihibit mixed states, simultaneously manic and dysphoric or 
depressive, but do not fall into the DSM-IV classification because not all required criteria for 
mania and major depression are met daily for at least one week. Other difficulties include 
classification of patients in the DSM-IV groups based on duration of phase since patients often 
cycle between excited and depressive episodes at different rates, in particular, it is reported that 
the use of antidepressants may alter the course of the disease for the worse by causing "rapid- 
cycling". Also making diagnosis more difficult is the fact that bipolar patients, particularly at 
what is known as Stage III mania, share symptoms of disorganized thinking and behavior with 
bipolar disorder patients. Furthermore, psychiatrists must distinguish between agitated 
depression and mixed mania; it is common that patients with major depression (14 days or 
more) exhibit agitiation, resulting in bipolar-like features. A yet further complicating factor is 
that bipolar patients have an exceptionally high rate of substance, particularly alcohol abuse. 
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While the prevalence of mania in alcoholic patients is low, it is well known that substance 
abusers can show excited symptoms. Difficulties therefore result for the diagnosis of bipolar 
patients with substance abuse. 
Treatment 

As there are currently no cures for bipolar disorder or schizophrenia, the objective of 
treatment is to reduce the severity of the symptoms, if possible to the point of remission. Due to 
the similarities in symptoms, schizophrenia and bipolar disorder are often treated with some of 
the same medicaments. Both diseases are often treated with antipsychotics and neuroleptics. 

For schizophrenia, for example, antipsychotic medications are the most common and 
most valuable treatments. There are four main classes of antipsychotic drugs which are 
commonly prescribed for schizophrenia. The first, neuroleptics, exemplified by chlorpromazine 
(Thorazine), has revolutionized the treatment of schizophrenic patients by reducing positive 
(psychotic) symptoms and preventing their recurrence. Patients receiving chlorpromazine have 
been able to leave mental hospitals and live in community programs or their own homes. But 
these drugs are far from ideal. Some 20% to 30% of patients do not respond to them at all, and 
others eventually relapse. These drugs were named neuroleptics because they produce serious 
neurological side effects, including rigidity and tremors in the arms and legs, muscle spasms, 
abnormal body movements, and akathisia (restless pacing and fidgeting). These side effects are 
so troublesome that many patients simply refuse to take the drugs. Besides, neuroleptics do not 
improve the so-called negative symptoms of schizophrenia and the side effects may even 
exacerbate these symptoms. Thus, despite the clear beneficial effects of neuroleptics, even 
some patients who have a good short-term response will ultimately deteriorate in overall 
functioning. 

The well known deficiencies in the standard neuroleptics have stimulated a search for 
new treatments and have led to a new class of drugs termed atypical neuroleptics. The first 
atypical neuroleptic, Clozapine, is effective for about one third of patients who do not respond 
to standard neuroleptics. It seems to reduce negative as well as positive symptoms, or at least 
exacerbates negative symptoms less than standard neuroleptics do. Moreover, it has beneficial 
effects on overall functioning and may reduce the chance of suicide in schizophrenic patients. It 
does not produce the troubling neurological symptoms of the standard neuroleptics, or raise 
blood levels of the hormone prolactin, excess of which may cause menstrual irregularities and 
infertility in women, impotence or breast enlargement in men. Many patients who cannot 
tolerate standard neuroleptics have been able to take clozapine. However, clozapine has serious 
limitations. It was originally withdrawn from the market because it can cause agranulocytosis, a 
potentially lethal inability to produce white blood cells. Agranulocytosis remains a threat that 
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requires careful monitoring and periodic blood tests. Clozapine can also cause seizures and 
other disturbing side effects (e.g., drowsiness, lowered blood pressure, drooling, bed-wetting, 
and weight gain). Thus it is usually taken only by patients who do not respond to other drugs. 

Researchers have developed a third class of antipsychotic drugs that have the virtues of 
clozapine without its defects. One of these drugs is risperidone (Risperdal). Early studies 
suggest that it is as effective as standard neuroleptic drugs for positive symptoms and may be 
somewhat more effective for negative symptoms. It produces more neurological side effects 
than clozapine but fewer than standard neuroleptics. However, it raises prolactin levels. 
Risperidone is now prescribed for a broad range of psychotic patients, and many clinicians seem 
to use it before clozapine for patients who do not respond to standard drugs, because they regard 
it as safer. Another new drug is Olanzapine (Zyprexa) which is at least as effective as standard 
drugs for positive symptoms and more effective for negative symptoms. It has few neurological 
side effects at ordinary clinical doses, and it does not significantly raise prolactin levels. 
Although it does not produce most of clozapine's most troubling side effects, including 
agranulocytosis, some patients taking olanzapine may become sedated or dizzy, develop dry 
mouth, or gain weight. In rare cases, liver function tests become transiently abnormal. 

Outcome studies in schizophrenia are usually based on hospital treatment studies and 
may not be representative of the population of schizophrenia patients. At the extremes of 
outcome, 20 % of patients seem to recover completely after one episode of psychosis, whereas 
14-19% of patients develop a chronic unremitting psychosis and never fully recover. In general, 
clinical outcome at five years seems to follow the rule of thirds: with about 35 % of patients in 
the poor outcome category; 36 % in the good outcome category, and the remainder with 
intermediate outcome. Prognosis in schizophrenia does not seem to worsen after five years. 

Whatever the reasons, there is increasing evidence that leaving schizophrenia untreated 
for long periods early in course of the illness may negatively affect the outcome. However, the 
use of drugs is often delayed for patients experiencing a first episode of the illness. The patients 
may not realize that they are ill, or they may be afraid to seek help; family members sometimes 
hope the problem will simply disappear or cannot persuade the patient to seek treatment; 
clinicians may hesitate to prescribe antipsychotic medications when the diagnosis is uncertain 
because of potential side effects. Indeed, at the first manifestation of the disease, schizophrenia 
is difficult to distinguish from bipolar manic-depressive disorders, severe depression, drug- 
related disorders, and stress-related disorders. Since the optimum treatments differ among these 
diseases, the long term prognosis of the disorder also differs the beginning of the treatment. 

For both schizophrenia and bipolar disorder, all the known molecules used for the 
treatment of schizophrenia have side effects and act only against the symptoms of the disease. 
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There is a strong need for new molecules without associated side effects and directed against 
targets which are involved in the causa] mechanisms of schizophrenia and bipolar disorder. 
Therefore, tools facilitating the discovery and characterization of these targets are necessary and 
useful. 

The aggregation of schizophrenia and bipolar disorder in families, the evidence from 
twin and adoption studies, and the lack of variation in incidence worldwide, indicate that 
schizophrenia and bipolar disorder are primarily genetic conditions, although environmental 
risk factors are also involved at some level as necessary, sufficient, or interactive causes. For 
example, schizophrenia occurs in 1% of the general population. But, if there is one grandparent 
with schizophrenia, the risk of getting the illness increases to about 3%; one parent with 
Schizophrenia, to about 10%. When both parents have schizophrenia, the risk rises to 
approximately 40%. 

Consequently, there is a strong need to identify genes involved in schizophrenia and 
bipolar disorder. The knowledge of these genes will allow researchers to understand the 
etiology of schizophrenia and bipolar disorder and could lead to drugs and medications which 
are directed against the cause of the diseases, not just against their symptoms. 

There is also a great need for new methods for detecting a susceptibility to 
schizophrenia and bipolar disorder, as well as for preventing or following up the development 
of the disease. Diagnostic tools could also prove extremely useful. Indeed, early identification 
of subjects at risk of developing schizophrenia would enable early and/or prophylactic treatment 
to be administered. Moreover, accurate assessments of the eventual efficacy of a medicament 
as well as the patent's eventual tolerance to it may enable clinicians to enhance the benefit/risk 
ratio of schizophrenia and bipolar disorder treatment regimes. 

SUMMARY OF THE INVENTION 

The present invention stems from the identification of novel polymorphisms including 
biallelic markers located on the human chromosome I3q31-q33 locus, the identification and 
characterization of novel schizophrenia-related genes located on the human chromosome 
13q31-q33 locus, and from the identification of genetic associations between alleles of biallelic 
markers located on the human chromosome 13q3 l-q33 locus and disease, as confirmed and 
characterized in a panel of human subjects. The invention furthermore provides a fine structure 
map of the region which includes the schizophrenia-associated gene sequences. 

The present invention pertains to nucleic acid molecules comprising the genomic 
sequences of novel human genes encoding g35030 proteins, proteins encoded thereby, as well 
as antibodies thereto. The g35030 genomic sequences may also comprise regulatory sequence 
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located upstream (5'-end) and downstream (3'-end) of the transcribed portion of said gene, 
these regulatory sequences being also part of the invention. The invention also deals with the 
cDNA sequences encoding g35030 proteins. 

Oligonucleotide probes or primers hybridizing specifically with a g35030 genomic or 
cDNA sequence are also part of the present invention, as well as DNA amplification and 
detection methods using said primers and probes. 

A further object of the invention consists of recombinant vectors comprising any of the 
nucleic acid sequences described above, and in particular of recombinant vectors comprising a 
g35030 regulatory sequence or a sequence encoding a g3503O protein, as well as of ceil hosts 
and transgenic non human animals comprising said nucleic acid sequences or recombinant 
vectors. 

The invention also concerns to biallelic markers of the g35030 gene and the use thereof. 
Included are probes and primers for use in genotyping biallelic markers of the invention. 

An embodiment of the invention encompasses any polynucleotide of the invention 
attached to a solid support polynucleotide may comprise a sequence disclosed in the present 
specification; optionally, said polynucleotide may comprise, consist of, or consist essentially of 
any polynucleotide described in the present specification; optionally, said determining may be 
performed in a hybridization assay, sequencing assay, microsequencing assay, or an enzyme- 
based mismatch detection assay; optionally, said polynucleotide may be attached to a solid 
support, array, or addressable array; optionally, said polynucleotide may be labeled. 

Finally, the invention is directed to drug screening assays and methods for the screening 
of substances for the treatment of schizophrenia, bipolar disorder or a related CNS disorder. 
One object of the invention deals with animal models of schizophrenia, including mouse, 
primate, non-human primate bipolar disorder or related CNS disorder. The invention is also 
directed to methods for the screening of substances or molecules that modulate the expression 
of g35030, as well as with methods for the screening of substances or molecules that interact 
with a g35030 polypeptide, or that modulate the activity of a g35030 polypeptide. 

As noted above, certain aspects of the present invention stem from the identification of 
genetic associations between schizophrenia and bipolar disorder and alleles of biallelic markers 
located on the human chromosome 13q3 l-q33 region, and more particularly on a subregion 
thereof referred to herein as Region D. The invention provides appropriate tools for 
establishing further genetic associations between alleles of biallelic markers on the 13q31- 
13q33 locus and either side effects or benefit resulting from the administration of agents acting 
on schizophrenia or bipolar disorder, or schizophrenia or bipolar disorder symptoms, including 
agents like chlorpromazine, clozapine, risperidone, olanzapine, sertindole, quetiapine and 
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ziprasidone. 

The invention provides appropriate tools for establishing further genetic associations 
between alleles of biallelic markers on the 13q31-q33 locus and a trait. Methods and products 
are provided for the molecular detection of a genetic susceptibility in humans to schizophrenia 
and bipolar disorder. They can be used for diagnosis, staging, prognosis and monitoring of this 
disease, which processes can be further included within treatment approaches. The invention 
also provides for the efficient design and evaluation of suitable therapeutic solutions including 
individualized strategies for optimizing drug usage, and screening of potential new medicament 
candidates. 

Additional embodiments are set forth in the Detailed Description of the Invention and 
in the Examples. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is a diagram showing the exon structure of the g35030 gene. 
Figure 2 is a table demonstrating the statistical significance of allelic frequencies of 
selected chromosome 13q31-q33 biallelic markers of the invention in sporadic and familial 
French Canadian schizophrenia cases and controls. 

Figure 3 is a table demonstrating the results of a haplotype association analysis between 
total French Canadian schizophrenia cases and haplotypes which consist of chromosome 13q31- 
q33 biallelic markers of the invention. 

Figure 4 is a table showing the involvement of selected biallelic markers of the 
invention in statistically significant haplotypes. 

Figure 5 is a table demonstrating the results of a haplotype association analysis between 
French Canadian schizophrenia cases and haplotypes which consist of chromosome 13q31-q33 
biallelic markers of the invention. 

Figure 6 is a table demonstrating the results of a haplotype association analysis between 
French Canadian schizophrenia cases and haplotypes which consist of chromosome 13q31-q33 
biallelic markers of the invention. 

Figures 7A and 7B show the results of a haplotype association analysis (Omnibus LR 
test value distribution) between schizophrenia cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figures 8A and 8B show the results of a haplotype association analysis (HaplotMaxM 
test value distribution) between schizophrenia cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figures 9A and 9B show the results of a haplotype association analysis (Omnibus LR 
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test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figures 1 OA and 10B show the results of a haplotype association analysis (HapIoMaxM 
test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figures 1 1 A and 1 1 B show the results of a haplotype association analysis (HaploMaxS 
test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figure 12 is a block diagram of an exemplary computer system. 

Figure 13 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 
homology levels between the new sequence and the sequences in the database. 

Figure 14 is a flow diagram illustrating one embodiment of a process 250 in a computer 
for determining whether two sequences are homologous. 

Figure 15 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. 

BRIEF DESCR IPTION OF THE SEQUENCES PROVIDED IN TffF 
SEQUENCE LISTTNC 

SEQ ID No. 1 contains the approximately 3 19kb of genomic nucleotide sequence 
comprising g35030 nucleic acid sequences and the biallelic markers Al to A71 located on the 
human chromosome I3q31-q33 locus. 

SEQ ID Nos. 2 to 17 contain cDNA sequences of the g35030 gene. 

SEQ ID Nos, 1 8 to 23 contain amino acid sequences of g35030 polypeptides, encoded 
by cDNAs of the invention. 

SEQ ID Nos. 24 to 78 respectively contain the nucleotide sequence of the amplicons 
which comprise the biallelic markers A72 to A 127 located on the human chromosome 13q3 1- 
q33 locus. 

SEQ ID No. 79 to 132 contain genomic nucleotide sequences comprising exons of the 
g35030 gene from several different primates. 

SEQ ID No 133 contains a primer containing the additional PU 5* sequence described 
further in Example 2. 

SEQ ID No 134 contains a primer containing the additional RP 5' sequence described 
fiirther in Example 2. 
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In accordance with the regulations relating to Sequence Listings, the following codes 
have been used in the Sequence Listing to indicate the locations of biallelic markers within the 
sequences and to identify each of the alleles present at the polymorphic base. The code "r" in 
the sequences indicates that one allele of the polymorphic base is a guanine, while the other 
allele is an adenine. The code "y" in the sequences indicates that one allele of the polymorphic 
base is a thymine, while the other allele is a cytosine. The code "m" in the sequences indicates 
that one allele of the polymorphic base is an adenine, while the other allele is an cytosine. The 
code "k" in the sequences indicates that one allele of the polymorphic base is a guanine, while 
the other allele is a thymine. The code "s" in the sequences indicates that one allele of the 
polymorphic base is a guanine, while the other allele is a cytosine. The code "w" in the 
sequences indicates that one allele of the polymorphic base is an adenine, while the other allele 
is an thymine. 

DETAILED DESCRIPTION OF THE INVENTION 

The identification of genes involved in a particular trait such as a specific central 
nervous system disorder, like schizophrenia, can be carried out through two main strategies 
currently used for genetic mapping: linkage analysis and association studies. Linkage analysis 
requires the study of families with multiple affected individuals and is now useful in the 
detection of mono- or oligogenic inherited traits. Conversely, association studies examine the 
frequency of marker alleles in unrelated trait (T+) individuals compared with trait negative (T-) 
controls, and are generally employed in the detection of polygenic inheritance. 
Candidate region on the chromosome 13 (linkage analysis) 
Genetic link or "linkage" is based on an analysis of which of two neighboring 
sequences on a chromosome contains the least recombinations by crossing-over during meiosis. 
To do this, chromosomal markers, like microsatellite markers, have been localized with 
precision on the genome. Genetic link analysis calculates the probabilities of recombinations on 
the target gene with the chromosomal markers used, according to the genealogical tree, the 
transmission of the disease, and the transmission of the markers. Thus, if a particular allele of a 
given marker is transmitted with the disease more often than chance would have it 
(recombination level between 0 and 0.5), it is possible to deduce that the target gene in question 
is found in the neighborhood of the marker. 

Using this technique, it has been possible to localize several genes demonstrating a 
genetic predisposition of familial cancers. In order to be able to be included in a genetic link 
study, the families affected by a hereditary form of the disease must satisfy the 
"informativeness" criteria: several affected subjects (and whose constitutional DNA is 
available) per generation, and at best having a large number of siblings. 
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By linkage analysis, observations have been made, according to which a candidate 
region for schizophrenia is present on chromosome 13q32 locus (Blouin et al., 1998). Linkage 
analysis has been successfully applied to map simple genetic traits that show clear Mendelian 
inheritance patterns and which have a high penetrance, but this method suffers from a variety of 
drawbacks. First, linkage analysis is limited by its reliance on the choice of a genetic model 
suitable for each studied trait. Furthermore, the resolution attainable using linkage analysis is 
limited, and complementary studies are required to refine the analysis of the typical 20 Mb 
regions initially identified through this method. In addition, linkage analysis have proven 
difficult when applied to complex genetic traits, such as those due to the combined action of 
multiple genes and/or environmental factors. In such cases, too great an effort and cost are 
needed to recruit the adequate number of affected families required for applying linkage 
analysis to these situations. Finally, linkage analysis cannot be applied to the study of traits for 
which no large informative families are available. 

In the present invention alternative means for conducting association studies rather than 
linkage analysis between markers located on the chromosome 13q31-q33 locus and a trait, 
preferably schizophrenia or bipolar disorder, are disclosed. 

In the present application, additional biallelic markers located on the human 
chromosome 13q3 l-q33 locus associated with schizophrenia are disclosed. The identification 
of these biallelic markers in association with schizophrenia has allowed for the further 
definition of the chromosomal region suspected of containing a genetic determinant involved in 
a predisposition to develop schizophrenia and has resulted in the identification of novel gene 
sequences disclosed herein which are associated with a predisposition to develop schizophrenia. 
The present invention thus provides an extensive fine structure map of the 13q31-q33 locus, 
including novel biallelic markers located on the human 13q31-q33 locus, approximately 31 9kb 
of genomic nucleotide sequence of a subregion of the human 13q3 l-q33 locus, and 
polymorphisms including biallelic markers and nucleotide deletions in said 3 19kb genomic 
sequence. The biallelic markers of the human chromosome 13q3 l-q33 locus and the nucleotide 
sequences, polymorphisms and gene sequences located in Region D subregion of the human 
chromosome 13q3I-q33 locus are useful as genetic and physical markers for further mapping 
studies. These identified polymorphisms are used in the design of assays for the reliable 
detection of genetic susceptibility to schizophrenia and bipolar disorder. They can also be used 
in the design of drug screening protocols to provide an accurate and efficient evaluation of the 
therapeutic and side-effect potential of new or already existing medicament or treatment regime. 

Using the biallelic markers, the inventors have identified the g35030 candidate 
schizophrenia gene, further described herein. The sequences comprising the the schizophrenia- 
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associated genes are useful, for example, for the isolation of other genes in putative gene 
families, the identification of homologs from other species, for the development of medicaments 
for treatment of disease, and as probes and primers for diagnostic or screening assays as 
described herein. 



Definitions 

As used interchangeably herein, the term " oligonucleotides ", and " polynucleotides " 
include RNA, DNA, or RNA/DNA hybrid sequences of more than one nucleotide in either 
single chain or duplex form. The term " nucleotide " as used herein as an adjective to describe 
molecules comprising RNA, DNA, or RNA/DNA hybrid sequences of any length in single- 
stranded or duplex form. The term "nucleotide" is also used herein as a noun to refer to 
individual nucleotides or varieties of nucleotides, meaning a molecule, or individual unit in a 
larger nucleic acid molecule, comprising a purine or pyrimidine, a ribose or deoxyribose sugar 
moiety, and a phosphate group, or phosphodiester linkage in the case of nucleotides within an 
oligonucleotide or polynucleotide. Although the term "nucleotide" is also used herein to 
encompass "modified nucleotides" which comprise at least one modifications (a) an alternative 
linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or (d) an 
analogous sugar, for examples of analogous linking groups, purine, pyrimidines, and sugars see 
for example PCT publication No. WO 95/04064. However, the polynucleotides of the invention 
are preferably comprised of greater than 50% conventional deoxyribose nucleotides, and most 
preferably greater than 90% conventional deoxyribose nucleotides. The polynucleotide 
sequences of the invention may be prepared by any known method, including synthetic, 
recombinant, ex vivo generation, or a combination thereof, as well as utilizing any purification 
methods known in the art. 

As used herein, the term "purified" does not require absolute purity; rather, it is 
intended as a relative definition. Purification of starting material or natural material is at least 
one order of magnitude, preferably two or three orders, and more preferably four or five orders 
of magnitude is expressly contemplated. As an example, purification from 0.1 % concentration 
to 1 0 % concentration is two orders of magnitude. 

To illustrate, individual cDNA clones isolated from a cDNA library have been 
conventionally purified to electrophoretic homogeneity. The sequences obtained from these 
clones could not be obtained directly either from the library or from total human DNA. The 
cDNA clones are not naturally occurring as such, but rather are obtained via manipulation of a 
partially purified naturally occurring substance (messenger RNA). The conversion of mRNA 
into a cDNA library involves the creation of a synthetic substance (cDNA) and pure individual 
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cDNA clones can be isolated from the synthetic library by clonal selection. Thus, creating a 
cDNA library from messenger RNA and subsequently isolating individual clones from that 
library results in an approximately 10 4 -10 6 fold purification of the native message. 

The term "purified" is further used herein to describe a polypeptide or polynucleotide of 
the invention which has been separated from other compounds including, but not limited to, 
polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomeric polypeptides of the invention from oligomeric forms such 
as homo- or hetero- dimers, trimers, etc. The term "purfled" may also be used to specify the 
separation of covalently closed polynucleotides from linear polynucleotides. A polynucleotide 
is substantially pure when at least about 50%, preferably 60 to 75% of a sample exhibits a single 
polynucleotide sequence and conformation (linear versus covalently close). A substantially 
pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 90% 
weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 
95%, and preferably is over about 99% pure. Polypeptide and polynucleotide purity, or 
homogeneity, is indicated by a number of means well known in the art, such as agarose or 
polyacrylamide gel electrophoresis of a sample, followed by visualizing a single band upon 
staining the gel. For certain purposes higher resolution can be provided by using HPLC or other 
means well known in the art. As an alternative embodiment, purification of the polypeptides and 
polynucleotides of the present invention may be expressed as "at least" a percent purity relative to 
heterologous polypeptides and polynucleotides (DNA, RNA or both). As a preferred embodiment, 
the polypeptides and polynucleotides of the present invention are at least; 10%, 20%, 30%, 40%, 
50%, 60%, 70%, 80%, 90%, 95%, 96%, 96%, 98%, 99%, or 100% pure relative to heterologous 
polypeptides and polynucleotides, respectively. As a further preferred embodiment the 
polypeptides and polynucleotides have a purity ranging from any number, to the thousandth 
position, between 90% and 100% (e.g., a polypeptide or polynucleotide at least 99.995% pure) 
relative to either heterologous polypeptides or polynucleotides, respectively, or as a weight/weight 
ratio relative to all compounds and molecules other than those existing in the carrier. Each 
number representing a percent purity, to the thousandth position, may be claimed as individual 
species of purity. 

The term "isolated" requires that the material be removed from its original environment 
(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials in 
the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 
polynucleotide or polypeptide could be part of a composition, and still be isolated in that the 
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vector or composition is not part of its natural environment. 

The term "primer" denotes a specific oligonucleotide sequence which is complementary 
to a target nucleotide sequence and used to hybridize to the target nucleotide sequence. A 
primer serves as an initiation point for nucleotide polymerization catalyzed by either DNA 
polymerase, RNA polymerase or reverse transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide analog 
segment, e.g., polynucleotide as defined herein) which can be used to identify a specific 
polynucleotide sequence present in samples, said nucleic acid segment comprising a nucleotide 
sequence complementary of the specific polynucleotide sequence to be identified. 

The terms "trait" and " phenotvpe " are used interchangeably herein and refer to any 
clinically distinguishable, detectable or otherwise measurable property of an organism such as 
symptoms of, or susceptibility to a disease for example. Typically the terms "trait" or 
"phenotype" are used herein to refer to symptoms of, or susceptibility to schizophrenia or 
bipolar disorder; or to refer to an individual's response to an agent acting on schizophrenia or 
bipolar disorder; or to refer to symptoms of, or susceptibility to side effects to an agent acting 
on schizophrenia or bipolar disorder. 

The term " allele " is used herein to refer to variants of a nucleotide sequence. A 
biallelic polymorphism has two forms. Typically the first identified allele is designated as the 
original allele whereas other alleles are designated as alternative alleles. Diploid organisms 
may be homozygous or heterozygous for an allelic form. 

The term "heterozygosity rate" is used herein to refer to the incidence of individuals in 
a population, which are heterozygous at a particular allele. In a biallelic system the 
heterozygosity rate is on average equal to 2P a (l-P a ), where P a is the frequency of the least 
common allele. In order to be useful in genetic studies a genetic marker should have an 
adequate level of heterozygosity to allow a reasonable probability that a randomly selected 
person will be heterozygous. 

The term "genotype" as used herein refers the identity of the alleles present in an 
individual or a sample. In the context of the present invention a genotype preferably refers to 
the description of the biallelic marker alleles present in an individual or a sample. The term 
"genotyping" a sample or an individual for a biallelic marker involves determining the specific 
allele or the specific nucleotide(s) carried by an individual at a biallelic marker. 

The term " mutation " as used herein refers to a difference in DNA sequence between or 
among different genomes or individuals which has a frequency below 1%. 

The term " haplotvpe " refers to a combination of alleles present in an individual or a 
sample on a single chromosome. In the context of the present invention a haplotype preferably 
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refers to a combination of biallelic marker alleles found in a given individual and which may be 
associated with a phenotype. 

The term "polymorphism" as used herein refers to the occurrence of two or more 
alternative genomic sequences or alleles between or among different genomes or individuals. 
"Polymorphic" refers to the condition in which two or more variants of a specific genomic 
sequence can be found in a population. A " polymorphic site " is the locus at which the variation 
occurs. A polymorphism may comprise a substitution, deletion or insertion of one or more 
nucleotides. A single nucleotide polymorphism is a single base pair change. Typically a single 
nucleotide polymorphism is the replacement of one nucleotide by another nucleotide at the 
polymorphic site. Deletion of a single nucleotide or insertion of a single nucleotide, also give 
rise to single nucleotide polymorphisms. In the context of the present invention "single 
nucleotide polymorphism" preferably refers to a single nucleotide substitution. Typically, 
between different genomes or between different individuals, the polymorphic site may be 
occupied by two different nucleotides. 

The terms "biallelic pol ymorphism " and " biallelic marker " are used interchangeably 
herein to refer to a polymorphism having two alleles at a fairly high frequency in the 
population, preferably a single nucleotide polymorphism. A " biallelic marker allele " refers to 
the nucleotide variants present at a biallelic marker site. Typically the frequency of the less 
common allele of the biallelic markers of the present invention has been validated to be greater 
than 1%, preferably the frequency is greater than 10%, more preferably the frequency is at least 
20% (i.e. heterozygosity rate of at least 0.32), even more preferably the frequency is at least 
30% (i.e. heterozygosity rate of at least 0.42). A biallelic marker wherein the frequency of the 
less common allele is 30% or more is termed a " high quality biallelic marker ." All of the 
genotyping, haplotyping, association, and interaction study methods of the invention may 
optionally be performed solely with high quality biallelic markers. 

The location of nucleotides in a polynucleotide with respect to the center of the 
polynucleotide are described herein in the following manner. When a polynucleotide has an 
odd number of nucleotides, the nucleotide at an equal distance from the 3' and 5' ends of the 
polynucleotide is considered to be " at the center " of the polynucleotide, and any nucleotide 
immediately adjacent to the nucleotide at the center, or the nucleotide at the center itself is 
considered to be " within 1 nucleotide of the center ." With an odd number of nucleotides in a 
polynucleotide any of the five nucleotides positions in the middle of the polynucleotide would 
be considered to be within 2 nucleotides of the center, and so on. When a polynucleotide has an 
even number of nucleotides, there would be a bond and not a nucleotide at the center of the 
polynucleotide. Thus, either of the two central nucleotides would be considered to be "within 1 
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nucleotide of the center" and any of the four nucleotides in the middle of the polynucleotide 
would be considered to be "within 2 nucleotides of the center", and so on. For polymorphisms 
which involve the substitution, insertion or deletion of 1 or more nucleotides, the 
polymorphism, allele or biallelic marker is "at the center" of a polynucleotide if the difference 
between the distance from the substituted, inserted, or deleted polynucleotides of the 
polymorphism and the 3' end of the polynucleotide, and the distance from the substituted, 
inserted, or deleted polynucleotides of the polymorphism and the 5' end of the polynucleotide is 
zero or one nucleotide. If this difference is 0 to 3, then the polymorphism is considered to be 
"within 1 nucleotide of the center." If the difference is 0 to 5, the polymorphism is considered 
to be "within 2 nucleotides of the center." If the difference is 0 to 7, the polymorphism is 
considered to be "within 3 nucleotides of the center," and so on. For polymorphisms which 
involve the substitution, insertion or deletion of 1 or more nucleotides, the polymorphism, allele 
or biallelic marker is "at the center" of a polynucleotide if the difference between the distance 
from the substituted, inserted, or deleted polynucleotides of the polymorphism and the 3' end of 
the polynucleotide, and the distance from the substituted, inserted, or deleted polynucleotides of 
the polymorphism and the 5' end of the polynucleotide is zero or one nucleotide. If this 
difference is 0 to 3, then the polymorphism is considered to be "within 1 nucleotide of the 
center." If the difference is 0 to 5, the polymorphism is considered to be "within 2 nucleotides 
of the center." If the difference is 0 to 7, the polymorphism is considered to be "within 3 
nucleotides of the center," and so on. 

The term "upstream" is used herein to refer to a location which, is toward the 5* end of 
the polynucleotide from a specific reference point. 

The terms "base paired" and " Watson & Crick base p aired" are used interchangeably 
herein to refer to nucleotides which can be hydrogen bonded to one another be virtue of their 
sequence identities in a manner like that found in double-helical DNA with thymine or uracil 
residues linked to adenine residues by two hydrogen bonds and cytosine and guanine residues 
linked by three hydrogen bonds (See Stryer, L., Biochemistry, 4th edition, 1995). 

The terms "complementary" or " complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 
another specified polynucleotide throughout the entirety of the complementary region. This 
term is applied to pairs of polynucleotides based solely upon their sequences and not any 
particular set of conditions under which the two polynucleotides would actually bind. 

The terms "g35030 gene ", when used herein, encompasses genomic, mRNA and 
cDNA sequences encoding a g35030 protein, including the untranslated regulatory regions of 
the genomic DNA. 
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As used herein the term " 13o3I-o33-related biallelic marker " relates to a biallelic 
marker residing in the human chromosome 13q3 l-q33 region. The term 13q31-q33-related 
biallelic marker encompasses all of the biallelic markers disclosed in Table 5 and any biallelic 
markers in linkage disequilibrium therewith, and any biallelic markers in linkage disequilibrium 
therewith. The preferred chromosome 13q31-q33-related biallelic marker alleles of the present 
invention include each one the alleles described in Table 5 individually or in groups consisting 
of all the possible combinations of the alleles listed. 

As used herein the term < 'g35030-relate d biallelic marker " relates to a set of biallelic 
markers in linkage disequilibrium with the g35030 gene or a g35O30 nucleotide sequence. The 
term g35030-related biallelic marker encompasses the biallelic markers A 13 to A65 disclosed in 
Table 5 and any biallelic markers in linkage disequilibrium therewith. 

The term "polypeptide" refers to a polymer of amino acids without regard to the length 
of the polymer; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not specify or exclude prost-expression modifications of 
polypeptides, for example, polypeptides which include the covalent attachment of glycosyl 
groups, acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed 
by the term polypeptide. Also included within the definition are polypeptides which contain 
one or more analogs of an amino acid (including, for example, non-naturally occurring amino 
acids, amino acids which only occur naturally in an unrelated biological system, modified 
amino acids from mammalian systems etc.), polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally occurring and non-naturally occurring. 

As used herein, the term "non-human animal" refers to any non-human vertebrate, birds 
and more usually mammals, preferably primates, farm animals such as swine, goats, sheep, 
donkeys, and horses, rabbits or rodents, more preferably rats or mice. As used herein, the term 
"animal" is used to refer to any vertebrate, preferable a mammal. Both the terms " animal " and 
" mammal " expressly embrace human subjects unless preceded with the term "non-human". 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
which are comprised of at least one binding domain, where an antibody binding domain is 
formed from the folding of variable domains of an antibody molecule to form three-dimensional 
binding spaces with an internal surface shape and charge distribution complementary to the 
features of an antigenic determinant of an antigen., which allows an immunological reaction 
with the antigen. Antibodies include recombinant proteins comprising the binding domains, as 
wells as fragments, including Fab, Fab', F(ab)2, and F(ab')2 fragments. 

As used herein, an "antigenic determinant" i s the portion of an antigen molecule, in this 
case a g35030 polypeptide, that determines the specificity of the antigen-antibody reaction. An 
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"epitope" refers to an antigenic determinant of a polypeptide. An epitope can comprise as few 
as 3 amino acids in a spatial conformation which is unique to the epitope. Generally an epitope 
comprises at least 6 such amino acids, and more usually at least 8-10 such amino acids. 
Methods for determining the amino acids which make up an epitope include x-ray 
crystallography, 2-dimensional nuclear magnetic resonance, and epitope mapping e.g. the 
Pepscan method described by Geysen et al. 1984; PCT Publication No. WO 84/03564; and 
PCT Publication No. WO 84/03506. 



Variants and Fragments 

The invention also relates to variants and fragments of the polynucleotides described 
herein, particularly of a nucleotide sequence of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
132, and particularly of a nucleotide sequence of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
1 32 containing one or more biallelic markers and/or other polymorphisms according to the 
invention. 

Variants of polynucleotides, as the term is used herein, are polynucleotides that differ 
from a reference polynucleotide. A variant of a polynucleotide may be a naturally occurring 
variant such as a naturally occurring allelic variant, or it may be a variant that is not known to 
occur naturally. Such non-naturally occurring variants of the polynucleotide may be made by 
mutagenesis techniques, including those applied to polynucleotides, cells or organisms. 
Generally, differences are limited so that the nucleotide sequences of the reference and the 
variant are closely similar overall and, in many regions, identical. 

Variants of polynucleotides according to the invention include, without being limited 
to, nucleotide sequences which are at least 95% identical to a polynucleotide selected from the 
group consisting of the nucleotide sequences SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 
or to any polynucleotide fragment of at least 8 consecutive nucleotides of a polynucleotide 
selected from the group consisting of the nucleotide SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 
to 132, and preferably at least 99% identical, more particularly at least 99.5% identical, and 
most preferably at least 99.8% identical to a polynucleotide selected from the group consisting 
of the nucleotide SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32 or to any polynucleotide 
fragment of at least 30, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000, to the extent that the 
length is consistent with the particular sequence ID, consecutive nucleotides of a polynucleotide 
selected from the group consisting of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 
28, 74 and 79 to 132. 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes 
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may also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 
involve one or more nucleotides. The variants may be altered in coding or non-coding regions 
or both. Alterations in the coding regions may produce conservative or non-conservative amino 
acid substitutions, deletions or additions. 

A polynucleotide fragment is a polynucleotide having a sequence that is entirely the 
same as part but not all of a given nucleotide sequence, preferably the nucleotide sequence of a 
g35030 polynucleotide, and variants thereof, or of a polynucleotide of any of SEQ ID Nos 1 to 
17, 24, 25, 28, 74 and 79 to 132, or a polynucleotide comprising one of the biallelic markers A I 
to A 127, or the complements thereof. Such fragments may be "free-standing", i.e. not part of or 
fused to other polynucleotides, or they may be comprised within a single larger polynucleotide 
of which they form a part or region. Indeed, several of these fragments may be present within a 
single larger polynucleotide. Optionally, such fragments may comprise, consist of, or consist 
essentially of a contiguous span of at least 8, 10, 12, 1 5, 1 8, 20, 25, 30, 35, 40, 50, 70, 80, 100, 
250, 500 , 1 000 or 2000 nucleotides in length of any of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 
79 to 132. 

Identity Between Nucleic Acids Or Polypeptides 

The terms "percentage of sequence identity" and " percentage homolo g y " are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and 
are determined by comparing two optimally aligned sequences over a comparison window, 
wherein the portion of the polynucleotide or polypeptide sequence in the comparison window 
may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which 
does not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical nucleic 
acid base or amino acid residue occurs in both sequences to yield the number of matched 
positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. Homology is evaluated using any of the variety of sequence comparison algorithms 
and programs known in the art. Such algorithms and programs include, but are by no means 
limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman, 
1988, Proc. Natl. Acad. Sci. USA 85(8):2444-2448; Altschul et al., 1990, J. Mol. Biol. 
215(3):403-410; Thompson et al., 1994, Nucleic Acids Res. 22(2):4673-4680; Higgins et al., 
1996, Methods Enzymol. 266:383-402; Altschul et al., 1990, J. Mol. Biol. 215(3):403-410; 
Altschul et al., 1993, Nature Genetics 3:266-272). In a particularly preferred embodiment, 
protein and nucleic acid sequence homologies are evaluated using the Basic Local Alignment 
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Search Tool ("BLAST") which is well known in the art (see, e.g., Karlin and Altschul, 1990, 
Proc. Natl. Acad. Sci. USA 87:2267-2268; Altschul et al., 1990, J. Mol. Biol. 215:403-410; 
Altschul et al, 1993, Nature Genetics 3:266-272; Altschul et al, 1 997, Nuc. Acids Res. 
25:3389-3402). In particular, five specific BLAST programs are used to perform the following 
task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

(3) BLASTX compares the six-frame conceptual translation products of a query 
nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence 
against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, 
which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic 
acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid 
sequence database. High-scoring segment pairs are. preferably identified (i.e., aligned) by 
means of a scoring matrix, many of which are known in the art. Preferably, the scoring matrix 
used is the BLOSUM62 matrix (Gonnet et al, 1992, Science 256: 1443-1445; Henikoffand 
HenikofT, 1993, Proteins 17:49-61). Less preferably, the PAM or PAM250 matrices may also 
be used (see, e.g., Schwartz and Dayhoff, eds, 1978, Matrices for Detecting Distance 
Relationships: Atlas of Protein Sequence and Structure, Washington: National Biomedical 
Research Foundation). The BLAST programs evaluate the statistical significance of all high- 
scoring segment pairs identified, and preferably selects those segments which satisfy a user- 
specified threshold of significance, such as a user-specified percent homology. Preferably, the 
statistical significance of a high-scoring segment pair is evaluated using the statistical 
significance formula of Karlin (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 
87:2267-2268). 

The BLAST programs may be used with the default parameters or with modified 
parameters provided by the user. 



Stringent Hybridization Conditions 

By way of example and not limitation, procedures using conditions of high stringency 
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are as follows: prehybridization of filters containing DNA is carried out for 8 h to overnight at 
65°C in buffer composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 
0.02% Ficoll, 0.02% BSA, and 500 ng/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture 
containing 100 ng/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-Iabeled probe. 
Subsequently, filter washes can be done at 37°C for 1 h in a solution containing 2 x SSC, 0.01% 
PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1 X SSC at 50°C for 45 min. 
Following the wash steps, the hybridized probes are detectable by autoradiography. Other 
conditions of high stringency which may be used are well known in the art and as cited in 
Sambrook et al., 1989; and Ausubel et al., 1989. These hybridization conditions are suitable 
for a nucleic acid molecule of about 20 nucleotides in length. There is no need to say that the 
hybridization conditions described above are to be adapted according to the length of the 
desired nucleic acid, following techniques well known to the one skilled in the art. The suitable 
hybridization conditions may for example be adapted according to the teachings disclosed in the 
book of Hames and Higgins (1985) or in Sambrook et ai.(1989). 

Genomic Sequences of g35030 polynucleotides 

The present invention concerns genomic DNA sequences of the g35030 gene. As 
referred to herein, genomic sequences of g35030 are indicated by nucleotide position in the 5' to 
3' orientation on SEQ ED No I. 

Particularly preferred nucleic acids of the invention include isolated, purified, or 
recombinant polynucleotides comprising, consisting essentially of, or consisting of a contiguous 
span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, or 1000 
nucleotides of nucleotide positions 201 123 to 247802 of SEQ ID No 1, or the complements 
thereof. Also encompassed are purified, isolated, or recombinant polynucleotides comprising a 
nucleotide sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with 
nucleotide positions 201 123 to 247802 of SEQ ID No 1, or a complementary sequence thereto 
or a fragment thereof. Also encompassed are purified, isolated, or recombinant nucleic acids 
that hybridize with said span under stringent hybridization conditions as defined above. Nucleic 
acids of the invention encompass g35030 nucleic acids from any source, including primate, 
non-human primate, mammalian and human. 

The human g35030 gene comprises exons selected from at least 14 different exons or 
exon forms, referred to herein as exons S, S2, T, U, V, VI, V2, V3, V4, W, W2, X, Y, Z. As 
indicated in Table 1, exons V and V2 are considered alternative exons, containing overlapping 
sequence. Likewise, exon W2 contains overlapping sequence with exons W and X. 
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The nucleotide positions of g35030 exons in SEQ ID No. 1 are detailed below in Table 
1 . The exon structure of the g35030 gene is further shown in Figure 1 . 

Table 1 



Exon 


Position in SEQID1 


Intron Position in SEQID1 


S 


201123 


201234 


201235 


20 119? 


S2 


201123 


201560 


201561 


Z. J HO / J 


T 


214676 


214793 


214794 




U 


215702 


215746 


215747 




V 


216836 


216994 


216995 


Z 1 /o/u 


V2 


216836 


217077 


217078 


217670 


VI 


217671 


217764 


217765 


227654 


V4 


227655 


227736 


227737 


238714 


V3 


238715 


238919 


238920 


240439 


W 


240440 


240673 


240674 


244352 or 241071 


W2 


240440 


241153 


241154 


244352 


X 


241072 


241291 


241292 


244352 


Y 


244353 


244561 


244562 


246272 


Z 


246273 


247802 







Thus, the invention embodies purified, isolated, or recombinant polynucleotides 
comprising a nucleotide sequence selected from the group consisting of the exons of theg35030 
gene, or a sequence complementary thereto. Preferred are purified, isolated, or recombinant 
polynucleotides comprising at least one exon having the nucleotide position ranges listed in 
Table 1 selected from the group consisting of the exons S, S2, T, U, V, VI, V2, V3, V4, W, 
W2, X, Y, Z of the g35030 gene, or a complementary sequence thereto or a fragment or a 
variant thereof. Also encompassed by the invention are purified, isolated, or recombinant 
nucleic acids comprising a combination of at least two exons of the g35030 gene selected from 
the group consisting of exons S, S2, T, U, V, VI , V2, V3, V4, W, W2, X, Y, Z, wherein the 
polynucleotides are arranged within the nucleic acid in the same relative order as in SEQ ID No. 
1. 

Particularly preferred nucleic acids of the invention include isolated, purified, or 
recombinant polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, 90, 100 or 200 nucleotides of SEQ ID No 1, wherein said 
contiguous span comprises at least 1 , 2, 3, 5, or 10 of the following nucleotide positions of SEQ 
ID No 1 : 20 1 1 23 to 20 1 234, 20 1 1 23 to 20 1 560, 2 1 4676 to 2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 
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to 216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 
240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 
247802, or the complements thereof. 

Another object of the invention consists of a purified, isolated, or recombinant nucleic 
acid that hybridizes with a g35030 nucleotide sequence of nucleotide positions 201 123 to 
201234, 201123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 24 11 53, 24 1 072 to 24 1 29 1 , 244353 to 24456 1 and 246273 to 247802 of SEQ ID No 
1, or a complementary sequence thereto or a variant thereof, under stringent hybridization 
conditions as defined above. 

The invention also encompasses a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence of g35030 having at least 70, 75, 80, 85, 90, or 95% 
nucleotide identity with a sequence selected from the group consisting of nucleotide positions 
20 1 1 23 to 20 1 234, 20 1 1 23 to 20 1 560, 2 1 4676 to 2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 to 
216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 
to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 247802 of 
SEQ ID No. 1 or a complementary sequence thereto or a fragment thereof. The nucleotide 
differences as regards the nucleotide positions 201 123 to 201234, 201 123 to 201560, 214676 to 
2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 767 1 to 2 1 7764, 227655 
to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 
244353 to 244561 and 246273 to 247802 of SEQ ID No. 1 may generally be distributed 
throughout the nucleic acid. 

The present invention further embodies purified, isolated, or recombinant 
polynucleotides comprising a nucleotide sequence selected from the group consisting of the 
introns of the g35030 gene, or a sequence complementary thereto. The present invention thus 
encompasses the g35030 gene as well as g35030 genomic sequences consisting of, consisting 
essentially of, or comprising the sequence of nucleotide positions 201235 to 201 122, 201561 to 
214675,214794 to 215701, 215747 to 216835, 216995 to 217670, 217078 to 217670, 217765 
to 227654, 227737 to 238714, 238920 to 240439, 240674 to 244352, 240674 to 241071, 
241 154 to 244352, 241292 to 244352 and 244562 to 246272 of SEQ ID No 1, a sequence 
complementary thereto, as well as fragments and variants thereof. 

These nucleic acids, as well as their fragments and variants, may be used as 
oligonucleotide primers or probes in order to detect the presence of a copy of a gene comprising 
a g35030 nucleic acid sequence in a test sample, or alternatively in order to amplify a target 
nucleotide sequence within a g35030 nucleic acid sequence or adjoining region. 
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Additional preferred nucleic acids of the invention include isolated, purified, or 
recombinant g35O30 polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 
25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 90, 100 or 200 nucleotides of nucleotide positions 
201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 
to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 
201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No 
1, or the complements thereof, wherein said contiguous span comprises at least one biallelic 
marker. Optionally, said contiguous span comprises a g35030-related biallelic marker. It should 
be noted that nucleic acid fragments of any size and sequence may also be comprised by the 
polynucleotides described in this section. Either the original or the alternative allele may be 
present at said biallelic marker. 

As noted above, g35030 gene sequence are indicated by position on SEQ ID Nol . 
Accordingly, in certain embodiments, a g35030 nucleotide sequence may also comprise 
isolated, purified, or recombinant polynucleotides comprising a contiguous span of at least 12, 
15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 2000 nucleotides of 
nucleotide positions 3 1 to 3 19608 of SEQ ID No. 1, or the complements thereof. Optionally, 
said contiguous span comprises a biallelic marker selected from the group consisting of A13 to 
Al 8, A20 to A47, A50 to A53, A56, A58, and A60 to A64. Optionally allele 2 is present at the 
biallelic marker. It should be noted that nucleic acid fragments of any size and sequence may 
be comprised by the polynucleotides described in this section. 

Further nucleic acids of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 1 2, 1 5, 1 8, 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90, 100, 1 50, 200, 300, 400 or 500 nucleotides, to the extent that the length of said span 
is consistent with the length of the SEQ ID, of SEQ ID Nos. 24, 25, 28 and 74. The invention 
also encompasses a purified, isolated, or recombinant polynucleotide comprising a nucleotide 
sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity to said span, as well as a 
purified, isolated, or recombinant nucleic acid that hybridizes with a said span under the 
stringent hybridization conditions as defined above. Optionally, said contiguous span comprises 
ar biallelic marker selected from the group consisting of A73 to A74, A77 and A124. 
Optionally allele 2 is present at the biallelic marker. It should be noted that nucleic acid 
fragments of any size and sequence may be comprised by the polynucleotides described in this 
section. 

Yet further nucleic acids of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
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70, 80, 90, 100, 1 50, 200, 500 or 1000 nucleotides, to the extent that said span is consistent with 
the nucleotide position range, of SEQ ID No 1, wherein said contiguous span comprises at least 
1,2, 3, 5, or 10 of the following nucleotide positions of SEQ ID No 1: 201 188 to 201234, 
2 1 6836 to 2 1 69 1 5 and 246273 to 24778 1 , or the complements thereof, as well as 
polynucleotides having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with said span, 
and polynucleotides capable of hybridizing with said span. In other embodiments, said 
contiguous span may specifically exclude one or more of the following nucleotide positions of 
SEQ ID No. 1 : 20 1 1 88 to 20 1 234, 2 1 6836 to 2 1 69 1 5, 2 1 4676 to 2 1 4793 and 2 1 5702 to 
215746. 

The present invention also comprises a purified or isolated nucleic acid encoding a 
g35030 protein having the amino acid sequence of any one of SEQ ID Nos 18 to 23 or a peptide 
fragment or variant thereof. 

It should be noted that nucleic acid fragments of any size and sequence may also be 
comprised by the polynucleotides described in this section. 

While this section is entitled "Genomic Sequences of g35030," it should be noted that 
nucleic acid fragments of any size and sequence may also be comprised by the polynucleotides 
described in this section, flanking the genomic sequences g35030 on either side or between two 
or more such genomic sequences. 

g35030 cDNA Sequences 

The expression of the g35030 gene has been shown to lead to the production of several 
mRNA species. Several cDNA sequences corresponding to these mRNA are set forth in SEQ 
ID Nos 2 to 17. 

The invention encompasses a purified, isolated, or recombinant nucleic acid comprising 
a nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 1 7, 
complementary sequences thereto, splice variants thereof, as well as allelic variants, and 
fragments thereof. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant g35030 cDNAs consisting of, consisting essentially of, or comprising a 
nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 17. Particularly 
preferred nucleic acids of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 50, 
60, 70, 75, 80, 100, 200, 500 or 1000 nucleotides, to the extent that the length of said 
contiguous span is consistent with the length of the SEQ ID, of a nucleotide sequence selected 
from the group consisting of SEQ ID Nos 2 to 1 7, or the complements thereof 

It should be noted that nucleic acid fragments of any size and sequence may also be 
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comprised by the polynucleotides described in this section. 

The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 70, 80, 85, 90 or 95% nucleotide identity with a polynucleotide 
selected from the group consisting of SEQ ID Nos 2 to 17, advantageously 99% nucleotide 
identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide identity 
with a polynucleotide selected from the group consisting of SEQ ID Nos 2 to 1 7, or a sequence 
complementary thereto or a biologically active fragment thereof. 

Another object of the invention relates to purified, isolated or recombinant nucleic acids 
comprising a polynucleotide that hybridizes, under the stringent hybridization conditions 
defined herein, with a polynucleotide selected from the group consisting of SEQ ID Nos 2 to 17, 
or a sequence complementary thereto or a variant thereof or a biologically active fragment 
thereof. 

The g35030 cDN A forms of SEQ ID Nos 2 to 1 7 are further described in Table 2 
below. Shown on the Table 2 are the positions of the 5' UTR, the open reading frame (ORF), 
the 3' UTR and the polyA signal on the respective SEQ ID No. Also shown are the g3503O 
exons comprising the cDNA form of a particular SEQ ID No. 



Table 2 



cDNA 


SEQ ED 
No of 
cDNA 


• Position range 
of5'UTRin 
SEQ ID 


Position range 
of ORF in 
SEQ ED 


Position range 
of3'UTRin 
SEQ ID 


Position range 
of poly A signal 
in SEQ ID 


VV1WYZ 


2 




95 


96 


167 


168 


2226 


2200 


2205 


VVIV3WYZ 


3 




95 


96 


167 


168 


2431 


2405 


2410 


V2WYZ 


4 




95 


96 


311 


312 


2215 


2189 


2194 


VWYZ 


5 




95 


96 


212 


213 


2132 


2106 


2111 


V2YZ 


6 




95 


96 


260 


261 


1981 


1955 


1960 


W1YZ 


7 




95 


96 


167 


168 


1992 


1966 


1971 


VYZ 


8 




95 


96 


182 


183 


1898 


1872 


1877 


W1Z 


9 




95 


96 


167 


168 


1783 


1757 


1762 


V2V1WYZ 


10 




95 


96 


311 


312 


2309 


2283 


2288 


V2V1YZ 


11 




95 


96 


311 


312 


2075 


2049 


2054 


VWZ 


12 




95 


96 


212 


213 


1923 


1897 


1902 


W1V4WYZ 


13 




95 


96 


167 


168 


2308 


2282 


2287 


VV1W2YZ 


14 




95 


96 


167 


168 


2706 


2680 


2685 


VW2YZ 


15 




95 


96 


212 


213 


2612 


2586 


2591 
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STUV 


16 


1 


370 


371 


436 


437 


434 






S2TUV 


17 


1 


696 


697 


762 


763 


760 







The present invention thus further encompasses variant g35O30 polynucleotides having 
at least one nucleotide substitution, insertion or deletion as described further herein. In 
preferred embodiments described in Table 3, the invention comprises g35030 polynucleotides 
comprising a variation as described herein by biallelic markers 8-126-286 (also referred to as 8- 
130-143) and 8-155-258. The invention thus encompasses purified, isolated, or recombinant 
polynucleotides and polypeptides encoded thereby, wherein the polynucleotides comprise a 
contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 100, 150, or 200 
nucleotides of SEQ ID No 2 to 17 or the complement thereof, wherein said contiguous span 
further comprises nucleotide position 132 of any of SEQ ID Nos 2 to 1 5 and 1 7, or nucleotide 
position 407 of SEQ ID No 16. Optionally, an A is present at said nucleotide position; 
optionally a G is present at said nucleotide position. In a further embodiment, said contiguous 
span comprises nucleotide position 328 or SEQ ID 3, or the complement thereof, and optionally 
a C is present at said position, or optionally a T is present at said position. 



Table 3 



SEQID 
No 


cDNA form 


Polymorphism 
reference 


i Alleles 


Position in 
SEQID 
No 1 


Position in SEQ 
ID No ofcDNA 


Position 
in amino 
acid seq 


2 


V1WYZ 


8-126-286/ 
8-130-143 


A 


G 


216967 


132 


13 


3 


W1V3WYZ 










132 


13 


4 


V2WYZ 










132 


13 


5 


VWYZ 










132 


13 


6 


V2YZ 










132 


13 


7 


VV1YZ 










132 


13 


8 


VYZ 










132 


13 


9 


W1Z 










132 


13 


10 


V2V1 WYZ 










132 


13 


11 


V2V1YZ 










132 


13 


12 


VWZ 










132 


13 


13 


W1V4WYZ 










132 


13 


14 


W1W2YZ 










132 


13 


15 


VW2YZ 










132 


13 


16 


STUV 










407 


13 


17 


S2TUV 










733 


13 


3 


W1V3WYZ 


8-155-258 


C 


T 


238789 


W1V3WYZ:328 





While this section is entitled "g35030 cDNA Sequences," it should be noted that nucleic 
acid fragments of any size and sequence may also be comprised by the polynucleotides 
described in this section, flanking the genomic sequences of g35030 on either side or between 
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G35030 Coding Regions 

The g35030 open reading frame is contained in the corresponding mRNA of a cDNA 
sequence selected from the group consisting of SEQ ID Nos 2 to 17. The effective g35030 
coding sequence (CDS) may include several forms as indicated above, in some embodiments 
encompassing isolated, purified, and recombinant polynucleotides which encode a polypeptide 
comprising a contiguous span of at least 4 amino acids, preferably 6, more preferably at least 8 
or 10 amino acids, yet more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of 
SEQ ID Nos 1 8 to 23. The effective g35030 coding sequence (CDS) may comprise the region 
between the first nucleotide of the ATG codon and the end nucleotide of the stop codon of SEQ 
ID Nos 2 to 17 as indicated in Table 2 above. 

The above disclosed polynucleotide that contains the coding sequence of the g35030 
gene may be expressed in a desired host cell or a desired host organism when this 
polynucleotide is placed under the control of suitable expression signals. The expression 
signals may be either the expression signals contained in the regulatory regions in the g35030 
gene of the invention or in contrast the signals may be exogenous regulatory nucleic sequences. 
Such a polynucleotide, when placed under the suitable expression signals, may also be inserted 
in a vector for its expression and/or amplification. 

Regulatory Sequences Of g35030 

As mentioned, the genomic sequence of the g35030 gene contains regulatory sequences 
both in the non-coding 5'-flanking region and in the non-coding 3'-flanking region that border 
the g35030 coding region containing the exons of the gene. 

Polynucleotides derived from the 5' and 3' regulatory regions are useful in order to 
detect the presence of at least a copy of a g35030 nucleotide sequence of SEQ ID No 1 or a 
fragment thereof in a test sample. 

The promoter activity of the 5' regulatory regions contained in g35030 can be assessed 
as described below. 

In order to identify the relevant biologically active polynucleotide fragments or variants 
of a g35030 regulatory region, one of skill in the art will refer to Sambrook et al.(1989), which 
describes the use of a recombinant vector carrying a marker gene (i.e. beta galactosidase, 
chloramphenicol acetyl transferase, etc.) the expression of which will be detected when placed 
under the control of a biologically active polynucleotide fragment or variant of the g35030 
sequence of SEQ ID No 1. Genomic sequences located upstream of the first exon of the g35030 
gene are cloned into a suitable promoter reporter vector, such as the pSEAP-Basic, pSEAP- 
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Enhancer, ppgal-Basic, ppgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from 
Clontech, or pGL2-basic or pGL3-basic promoterless luciferase reporter gene vector from 
Promega. Briefly, each of these promoter reporter vectors include multiple cloning sites 
positioned upstream of a reporter gene encoding a readily assayable protein such as secreted 
alkaline phosphatase, luciferase, p galactosidase, or green fluorescent protein. The sequences 
upstream of the g35030 coding region are inserted into the cloning sites upstream of the reporter 
gene in both orientations and introduced into an appropriate host cell. The level of reporter 
protein is assayed and compared to the level obtained from a vector which lacks an insert in the 
cloning site. The presence of an elevated expression level in the vector containing the insert 
with respect to the control vector indicates the presence of a promoter in the insert. If 
necessary, the upstream sequences can be cloned into vectors which contain an enhancer for 
increasing transcription levels from weak promoter sequences. A significant level of expression 
above that observed with the vector lacking an insert indicates that a promoter sequence is 
present in the inserted upstream sequence. 

Promoter sequence within the upstream genomic DNA may be further defined by 
constructing nested 5' and/or 3' deletions in the upstream DNA using conventional techniques 
such as Exonuclease III or appropriate restriction endonuclease digestion. The resulting 
deletion fragments can be inserted into the promoter reporter vector to determine whether the 
deletion has reduced or obliterated promoter activity, such as described, for example, by Coles 
et al.(1998). In this way, the boundaries of the promoters may be defined. If desired, potential 
individual regulatory sites within the promoter may be identified using site directed mutagenesis 
or linker scanning to obliterate potential transcription factor binding sites within the promoter 
individually or in combination. The effects of these mutations on transcription levels may be 
determined by inserting the mutations into cloning sites in promoter reporter vectors. This type 
of assay is well-known to those skilled in the art and is described in WO 97/17359, US Patent 
No. 5,374,544; EP 582 796; US Patent No. 5,698,389; US 5,643,746; US Patent No. 5,502,176; 
and US Patent 5,266,488. 

The strength and the specificity of the promoter of the g35030 gene can be assessed 
through the expression levels of a detectable polynucleotide operably linked to the g35030 
promoter in different types of cells and tissues. The detectable polynucleotide may be either a 
polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 
polynucleotide encoding a detectable protein, including a g35030 polypeptide or a fragment or a 
variant thereof. This type of assay is well-known to those skilled in the art and is described in 
US Patent No. 5,502,1 76; and US Patent No. 5,266,488. Some of the methods are discussed in 
more detail below. 
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Polynucleotides carrying the regulatory elements located at the 5' end and at the 3' end 
of the g35030 coding region may be advantageously used to control the transcriptional and 
translational activity of an heterologous polynucleotide of interest. 

Thus, the present invention also concerns a purified or isolated nucleic acid comprising 
a polynucleotide which is selected from the group consisting of the 5' and 3' regulatory regions 
of g35030, or a sequence complementary thereto or a biologically active fragment or variant 
thereof. In one aspect, "5' regulatory region 5 ' may comprise the nucleotide sequence located 
between positions 1 99 1 22 to 20 1 1 22 of SEQ ID No 1 . In one aspect, "3 ' regulatory region" 
may comprise the nucleotide sequence located between positions 247803 to 249803 of SEQ ID 
No 1. 

The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with a polynucleotide selected from the 
group consisting of the 5' and 3' regulatory regions, advantageously 99 % nucleotide identity, 
preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide identity with a 
polynucleotide selected from the group consisting of the 5' and 3' regulatory regions, or a 
sequence complementary thereto or a variant thereof or a biologically active fragment thereof. 

Another object of the invention consists of purified, isolated or recombinant nucleic 
acids comprising a polynucleotide that hybridizes, under the stringent hybridization conditions 
defined herein, with a polynucleotide selected from the group consisting of the nucleotide 
sequences of the 5'- and 3' regulatory regions of g35030, or a sequence complementary thereto 
or a variant thereof or a biologically active fragment thereof. 

Preferred fragments of the 5' regulatory region have a length of about 1500 or 1000 
nucleotides, preferably of about 500 nucleotides, more preferably about 400 nucleotides, even 
more preferably 300 nucleotides and most preferably about 200 nucleotides. 

Preferred fragments of the 3' regulatory region are at least 50, 100, 150, 200, 300 or 
400 bases in length. 

"Biologically active" g35030 regulatory polynucleotide derivatives of SEQ ID No 1 are 
polynucleotides comprising or alternatively consisting in a fragment of said polynucleotide 
which is functional as a regulatory region for expressing a recombinant polypeptide or a 
recombinant polynucleotide in a recombinant cell host. It could act either as an enhancer or as a 
repressor. 

For the purpose of the invention, a nucleic acid or polynucleotide is "functional" as a 
regulatory region for expressing a recombinant polypeptide or a recombinant polynucleotide if 
said regulatory polynucleotide contains nucleotide sequences which contain transcriptional and 
translational regulatory information, and such sequences are "operably linked" to nucleotide 
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sequences which encode the desired polypeptide or the desired polynucleotide. 

The regulatory polynucleotides of the invention may be prepared from the nucleotide 
sequence of SEQ ID No 1 by cleavage using suitable restriction enzymes, as described for 
example in Sambrook et al.(1989). The regulatory polynucleotides may also be prepared by 
digestion of SEQ ID No 1 by an exonuclease enzyme, such as BaI31 (Wabiko et al., 1986). 
These regulatory polynucleotides can also be prepared by nucleic acid chemical synthesis, as 
described elsewhere in the specification. 

The g35030 regulatory polynucleotides according to the invention may be part of a 
recombinant expression vector that may be used to express a coding sequence in a desired host 
cell or host organism. The recombinant expression vectors according to the invention are 
described elsewhere in the specification. 

A preferred 5'-regulatory polynucleotide of the invention includes the 5 '-untranslated 
region (5'-UTR) of the g35030 cDNA, or a biologically active fragment or variant thereof. 

A preferred 3 '-regulatory polynucleotide of the invention includes the 3 '-untranslated 
region (3'-UTR) of the g35030 cDNA, or a biologically active fragment or variant thereof. 

A further object of the invention consists of a purified or isolated nucleic acid 
comprising: 

a) a nucleic acid comprising a regulatory nucleotide sequence selected from the group 
consisting of: 

(i) a nucleotide sequence comprising a polynucleotide of the g35030 5' regulatory region or a 
complementary sequence thereto; 

(ii) a nucleotide sequence comprising a polynucleotide having at least 95% of nucleotide 
identity with the nucleotide sequence of the g35030 5' regulatory region or a complementary 
sequence thereto; 

(iii) a nucleotide sequence comprising a polynucleotide that hybridizes under stringent 
hybridization conditions with the nucleotide sequence of the g35030 5' regulatory region or a 
complementary sequence thereto; and 

(iv) a biologically active fragment or variant of the polynucleotides in (i), (ii) and (iii); 

b) a polynucleotide encoding a desired polypeptide or a nucleic acid of interest, 
operably linked to the nucleic acid defined in (a) above; and 

c) optionally, a nucleic acid comprising a 3'- regulatory polynucleotide, preferably a 3'- 
regulatory polynucleotide of the g35030 gene. 

In a specific embodiment of the nucleic acid defined above, said nucleic acid includes 
the S'-untranslated region (5'-UTR) of the g35030 cDNA, or a biologically active fragment or 
variant thereof. 
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In a second specific embodiment of the nucleic acid defined above, said nucleic acid 
includes the 3 '-untranslated region (3'-UTR) of the g35030 cDNA, or a biologically active 
fragment or variant thereof. 

The regulatory polynucleotide of the 5' regulatory region, or its biologically active 
fragments or variants, is operably linked at the 5 '-end of the polynucleotide encoding the 
desired polypeptide or polynucleotide. 

The regulatory polynucleotide of the 3' regulatory region, or its biologically active 
fragments or variants, is advantageously operably linked at the 3'-end of the polynucleotide 
encoding the desired polypeptide or polynucleotide. 

The desired polypeptide encoded by the above-described nucleic acid may be of various 
nature or origin, encompassing proteins of prokaryotic or eukaryotic origin. Among the 
polypeptides expressed under the control of a g35030 regulatory region include bacterial, fungal 
or viral antigens. Also encompassed are eukaryotic proteins such as intracellular proteins, like 
"house keeping" proteins, membrane-bound proteins, like receptors, and secreted proteins like 
endogenous mediators such as cytokines. The desired polypeptide may be the g35030 protein, 
especially the protein of the amino acid sequences of SEQ ID Nos 18 to 23, or a fragment or a 
variant thereof. 

The desired nucleic acids encoded by the above-described polynucleotide, usually an 
RNA molecule, may be complementary to a desired coding polynucleotide, for example to the 
g35030 coding sequence, and thus useful as an antisense polynucleotide. 

Such a polynucleotide may be included in a recombinant expression vector in order to 
express the desired polypeptide or the desired nucleic acid in host cell or in a host organism. 
Suitable recombinant vectors that contain a polynucleotide such as described herein are 
disclosed elsewhere in the specification. 



Polynucleotide Constructs 

The terms "polynucle otide construct " and " recombinant polynucleotide " are used 
interchangeably herein to refer to linear or circular, purified or isolated polynucleotides that 
have been artificially designed and which comprise at least two nucleotide sequences that are 
not found as contiguous nucleotide sequences in their initial natural environment. It should be 
noted that the present invention embodies recombinant vectors comprising any one of the 
polynucleotides described in the present invention. 

DNA Constructs that Enables Directing Temporal and Spatial Expression of 
g35030 Nucleic Acid Sequences in Recombinant Cell Hosts and in Transgenic Animals 
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In order to study the physiological and phenotypic consequences of a lack of synthesis 
of a protein encoded by a nucleotide sequence comprising a g35030 polynucleotide, both at the 
cell level and at the multi cellular organism level, the invention also encompasses DNA 
constructs and recombinant vectors enabling a conditional expression of a specific allele of a 
nucleotide sequence comprising a g35030 polynucleotide and also of a copy of a sequence 
comprising a nucleotide sequence of a g35030 polynucleotide, or a fragment thereof, harboring 
substitutions, deletions, or additions of one or more bases. These base substitutions, deletions 
or additions may be located either in an exon, an intron or a regulatory sequence, in particular a 
5' regulatory sequence of a g35030 polynucleotide. In a preferred embodiment, the nucleotide 
sequence comprising a g35030 polynucleotide further comprises a biallelic marker of the 
present invention. 

A first preferred DNA construct is based on the tetracycline resistance operon tet from 
E. coli transposon Tnl 10 for controlling the expression of a g35030 polynucleotide, such as 
described by Gossen et al. ( 1 992, 1 995) and Furth et al.( 1 994). Such a DNA construct contains 
seven tet operator sequences from Tn 1 0 (tetop) that are fused to either a minimal promoter or a 
S'-reguIatory sequence of the g35030 polynucleotide, said minimal promoter or said g35030 
polynucleotide regulatory sequence being operably linked to a polynucleotide of interest that 
codes either for a sense or an antisense oligonucleotide or for a polypeptide, including a g35030 
polynucleotide-encoded polypeptide or a peptide fragment thereof. This DNA construct is 
functional as a conditional expression system for the nucleotide sequence of interest when the 
same cell also comprises a nucleotide sequence coding for either the wild type (tTA) or the 
mutant (rTA) repressor fused to the activating domain of viral protein VP 16 of herpes simplex 
virus, placed under the control of a promoter, such as the HCMVIEI enhancer/promoter or the 
MMTV-LTR. Indeed, a preferred DNA construct of the invention comprises both the 
polynucleotide containing the (et operator sequences and the polynucleotide containing a 
sequence coding for the tTA or the rTA repressor. 

In a specific embodiment, the conditional expression DNA construct contains the 
sequence encoding the mutant tetracycline repressor rTA, the expression of the polynucleotide 
of interest is silent in the absence of tetracycline and induced in its presence. 

DNA Constructs Allowing Homologous Recombination: Replacement Vectors 
A second preferred DNA construct will comprise, from 5'-end to 3'-end: (a) a first 
nucleotide sequence comprising a g35030 polynucleotide; (b) a nucleotide sequence 
comprising a positive selection marker, such as the marker for neomycine resistance (neo); and 
(c) a second nucleotide sequence comprising a respective g35030 polynucleotide, and is located 
on the genome downstream of the first g35030 polynucleotide sequence (a). 
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In a preferred embodiment, this DNA construct also comprises a negative selection 
marker located upstream the nucleotide sequence (a) or downstream the nucleotide sequence 
(c). Preferably, the negative selection marker comprises the thymidine kinase (tk) gene 
(Thomas et al., 1986), the hygromycine beta gene (Te Riele et al., 1 990), the hprt gene ( Van 
derLugtetal., 1991; Reidetal, 1990) or the Diphteria toxin A fragment [DhA) gene (Nada et 
al., 1993; Yagi et al.1990). Preferably, the positive selection marker is located within and exon 
of a g35030 polynucleotide so as to interrupt the sequence encoding the g35030 protein. These 
replacement vectors are described, for example, by Thomas et al.( 1 986; 1 987), Mansour et 
al.(1988) and Koller et al.(1992). 

The first and second nucleotide sequences (a) and (c) may be indifferently located 
within a g35030 polynucleotide regulatory sequence, an intronic sequence, an exon sequence or 
a sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
nucleotide sequence of (a) and (c) ranges from I to 50 kb, preferably from 1 to 10 kb, more 
preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre-LoxP System. 

These new DNA constructs make use of the site specific recombination system of the 
PI phage. The PI phage possesses a recombinase called Cre which interacts specifically with a 
34 base pairs loxP site. The loxP site is composed of two palindromic sequences of 13 bp 
separated by a 8 bp conserved sequence (Hoess et al., 1986). The recombination by the Cre 
enzyme between two loxP sites having an identical orientation leads to the deletion of the DNA 
fragment. 

The Cre-/atP system used in combination with a homologous recombination technique 
has been first described by Gu et aL(1993, 1994). Briefly, a nucleotide sequence of interest to 
be inserted in a targeted location of the genome harbors at least two lox? sites in the same 
orientation and located at the respective ends of a nucleotide sequence to be excised from the 
recombinant genome. The excision event requires the presence of the recombinase (Cre) 
enzyme within the nucleus of the recombinant cell host. The recombinase enzyme may be 
brought at the desired time either by (a) incubating the recombinant cell hosts in a culture 
medium containing this enzyme, by injecting the Cre enzyme directly into the desired cell, such 
as described by Araki et al.(1995), or by lipofection of the enzyme into the cells, such as 
described by Baubonis et al.(1993); (b) transfecting the cell host with a vector comprising the 
Cre coding sequence operably linked to a promoter functional in the recombinant cell host, 
which promoter being optionally inducible, said vector being introduced in the recombinant cell 
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host, such as described by Gu et al.(1993) and Sauer et al.(1988); (c) introducing in the genome 
of the cell host a polynucleotide comprising the Cre coding sequence operably linked to a 
promoter functional in the recombinant cell host, which promoter is optionally inducible, and 
said polynucleotide being inserted in the genome of the cell host either by a random insertion 
event or an homologous recombination event, such as described by Gu et al.(I994). 

In a specific embodiment, the vector containing the sequence to be inserted in a g35030 
gene sequence by homologous recombination is constructed in such a way that selectable 
markers are flanked by lox? sites of the same orientation, it is possible, by treatment by the Cre 
enzyme, to eliminate the selectable markers while leaving the g35030 polynucleotide sequences 
of interest that have been inserted by an homologous recombination event. Again, two 
selectable markers are needed: a positive selection marker to select for the recombination event 
and a negative selection marker to select for the homologous recombination event. Vectors and 
methods using the Cre-tecP system are described by Zou et al.(1994). 

Thus, in one aspect, a further preferred DNA construct of the invention comprises, from 
5'-end to 3'-end: (a) a first nucleotide sequence that is comprised by a g35030 polynucleotide; 
(b) a nucleotide sequence comprising a polynucleotide encoding a positive selection marker, 
said nucleotide sequence comprising additionally two sequences defining a site recognized by a 
recombinase, such as a lox? site, the two sites being placed in the same orientation; and (c) a 
second nucleotide sequence comprising a g35030 polynucleotide, and is located on the genome 
downstream of the first g35030 polynucleotide sequence (a). 

The sequences defining a site recognized by a recombinase, such as a lox? site, are 
preferably located within the nucleotide sequence (b) at suitable locations bordering the 
nucleotide sequence for which the conditional excision is sought. In one specific embodiment, 
two lox? sites are located at each side of the positive selection marker sequence, in order to 
allow its excision at a desired time after the occurrence of the homologous recombination event. 

In a preferred embodiment of a method using the third DNA construct described above, 
the excision of the polynucleotide fragment bordered by the two sites recognized by a 
recombinase, preferably two lox? sites, is performed at a desired time, due to the presence 
within the genome of the recombinant host cell of a sequence encoding the Cre enzyme 
operably linked to a promoter sequence, preferably an inducible promoter, more preferably a 
tissue-specific promoter sequence and most preferably a promoter sequence which is both 
inducible and tissue-specific, such as described by Gu et al.(1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may 
result from the breeding of two transgenic animals, the first transgenic animal bearing the 
g35030 polynucleotide -derived sequence of interest containing the lox? sites as described 
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above and the second transgenic animal bearing the Cre coding sequence operably linked to a 
suitable promoter sequence, such as described by Gu et al.(1994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an 
adenovirus based vector that contains the Cre gene thus allowing infection of cells, or in vivo 
infection of organs, for delivery of the Cre enzyme, such as described by Anton and Graham 
(1995) and Kanegae et al.(1995). 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a g35030 polynucleotide, and most preferably an altered 
copy of a g35030 polynucleotide sequence, within a predetermined location of the targeted 
genome, leading either to the generation of an altered copy of a targeted gene (knock-out 
homologous recombination) or to the replacement of a copy of the targeted gene by another 
copy sufficiently homologous to allow an homologous recombination event to occur (knock-in 
homologous recombination). In a specific embodiment, the DNA constructs described above 
may be used to introduce a g35030 polynucleotide. 
Nuclear Antisense DNA Constructs 

Other compositions containing a vector of the invention comprise an oligonucleotide 
fragment of the g35030 polynucleotide sequences of SEQ ID No.l respectively, as an antisense 
tool that inhibits the expression of the corresponding gene. Preferred methods using antisense 
polynucleotide according to the present invention are the procedures described by Sczakiel et 
al.(1995) or those described in PCT Application No WO 95/24223. 

Preferably, the antisense tools are chosen among the polynucleotides (15-200 bp long) 
that are complementary to the 5*end of a g3503O polynucleotide mRNA. In one embodiment, a 
combination of different antisense polynucleotides complementary to different parts of the 
desired targeted gene are used. 

Preferably, the antisense polynucleotides of the invention have a 3' polyadenylation 
signal that has been replaced with a self-cleaving ribozyme sequence, such that RNA 
polymerase II transcripts are produced without poly(A) at their 3' ends, these antisense 
polynucleotides being incapable of export from the nucleus, such as described by Liu et 
al.(1994). In a preferred embodiment, these g35030 antisense polynucleotides also comprise, 
within the ribozyme cassette, a histone stem-loop structure to stabilize cleaved transcripts 
against 3-5' exonucleolytic degradation, such as the structure described by Eckner et al.(1991) ; 
Constructs for Activation of g35030 gene 

The present invention also encompasses primary, secondary, and immortalized 
homologously recombinant host cells of vertebrate origin, preferably mammalian origin and 
particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 
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polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete 
endogenous chromosomal DNA, and/or c) replace endogenous chromosomal DNA with 
exogenous polynucleotides. Insertions, deletions, and/or replacements of polynucleotide 
sequences may be to the coding sequences of the targeted gene and/or to regulatory regions, 
such as promoter and enhancer sequences, operably associated with the targeted gene. 

The present invention further relates to a method of making a homologously 
recombinant host cell in vitro or in vivo, wherein the expression of a targeted gene not normally 
expressed in the cell is altered. Preferably the alteration causes expression of the targeted gene 
under normal growth conditions or under conditions suitable for producing the polypeptide 
encoded by the targeted gene. The method comprises the steps of: (a) transfecting the cell in 
vitro or in vivo with a polynucleotide construct, the a polynucleotide construct comprising; (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired 
splice donor site, if necessary, thereby producing a transfected cell; and (b) maintaining the 
transfected cell in vitro or in vivo under conditions appropriate for homologous recombination. 

The present invention further relates to a method of altering the expression of a targeted 
gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, 
comprising the steps of: (a) transfecting the cell in vitro or in vivo with a a polynucleotide 
construct, the a polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory 
sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby 
producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under 
conditions appropriate for homologous recombination, thereby producing a homologously 
recombinant cell; and (c) maintaining the homologously recombinant cell in vitro or in vivo 
under conditions appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 
invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 
wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting 
the cell in vitro with a a polynucleotide construct, the a polynucleotide construct comprising: (i) 
a targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired 
splice donor site, if necessary, thereby producing a transfected cell; (b) maintaining the 
transfected cell in vitro or in vivo under conditions appropriate for homologous recombination, 
thereby producing a homologously recombinant cell; and c) maintaining the homologously 
recombinant cell in vitro or in vivo under conditions appropriate for expression of the gene 
thereby making the polypeptide. 

The present invention further relates to a a polynucleotide construct which alters the 
expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
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occurs when the a polynucleotide construct is inserted into the chromosomal DNA of the target 
cell, wherein the a polynucleotide construct comprises: a) a targeting sequence; b) a regulatory 
sequence and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Further 
included are a polynucleotide constructs, as described above, wherein the contruct ftirther 
comprises A a polynucleotide which encodes a polypeptide and is in-frame with the targeted 
endogenous gene after homologous recombination with chromosomal DNA. 

The compositions may be produced, and methods performed, by techniques known in 
the art, such as those described in U.S. Patent Nos: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
5,580,734 ; International Publication Nos:W096/2941 1, WO 94/12650; and scientific articles 
including 1994; Koller et al M Proc. Natl. Acad. Sci. USA 86:8932-8935 (1989) (the disclosures 
of each of which are incorporated by reference in their entireties). 

Oligonucleotide Probes And Primers 

The polynucleotides of the invention are useful in order to detect the presence of at least 
a copy of a g35030 nucleotide sequence of SEQ ID Nos 1 to 1 7 or of the respective g35030 
polynucleotide or gene, or a fragment, complement, or variant thereof in a test sample. 

Particularly preferred probes and primers of the invention include isolated, purified, or 
recombinant polynucleotides comprising a contiguous span of at least 12, 1 5, 18, 20, 25, 30, 35, 
40, 50, 60, 70, 80, 90, 1 00, 1 50, 200, 500, 1 000 or 2000 nucleotides, to the extent that said span 
is consistent with the length of the nucleotide position range, of SEQ ID No 1, wherein said 
contiguous span comprises at least 1, 2, 3, 4, 5, 7 or 10 of the following nucleotide positions of 
SEQ ID No 1: 201 123 to 201 234, 201123 to 20 1 560, 2 1 4676 to 2 1 4793, 2 1 5702 to 2 1 5746, 
216836 to 216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 
238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 
to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803, or a 
complementary sequence thereto or a fragment thereof. 

Probes and primers of the invention also include isolated, purified, or recombinant 
polynucleotides having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with a contiguous 
span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 
2000 nucleotides of nucleotide positions 201 123 to 201234, 201 123 to 201560, 214676 to 
2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 767 1 to 2 1 7764, 227655 
to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 
244353 to 244561, 246273 to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 
and 199122 to 249803 of SEQ ID No. 1. Preferred probes and primers of the invention also 
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include isolated, purified, or recombinant polynucleotides comprising a g35030 nucleotide 
sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with at least one 
sequence selected from the group consisting of the following nucleotide positions of SEQ ID 
No. 1: 201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 
to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 
201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803; or a 
complementary sequence thereto or a fragment thereof. 

The invention also relates to nucleic acid probes characterized in that they hybridize 
specifically, under the stringent hybridization conditions defined above, with a contiguous span 
of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 2000 
nucleotides of nucleotide positions 201 123 to 201234, 201 123 to 201560, 214676 to 214793, 
2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 7671 to 2 1 7764, 227655 to 
227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 
to 244561 , 246273 to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 
199122 to 249803 of SEQ ID No. 1, or a variant thereof or a sequence complementary thereto. 

Probes and primers of the invention further include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90, 1 00, 150, 200, 500, 1000 or 2000 nucleotides, to the extent that said span is 
consistent with the length of the nucleotide position range, of SEQ ID No 2 to 17 or 79 to 132. 
Also included are isolated, purified, or recombinant polynucleotides having at least 70, 75, 80, 
85, 90, or 95% nucleotide identity with a contiguous span of at least 12, 1 5, 18, 20, 25, 30, 35, 
40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 2000 nucleotides of SEQ ID No. 2 to 17 or 
79 to 132, or a complementary sequence thereto, as well as isolated, purified, or recombinant 
polynucleotides comprising a g35030 nucleotide sequence having at least 70, 75, 80, 85, 90, or 
95% nucleotide identity with at least one sequence selected from the group consisting of the 
following nucleotide positions of SEQ ID No. 2 to 1 7 or 79 to 1 32; or a complementary 
sequence thereto or a fragment thereof. The invention also relates to nucleic acid probes 
characterized in that they hybridize specifically, under the stringent hybridization conditions 
defined above, with a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 
90, 1 00, 1 50, 200, 500, 1 000 or 2000 nucleotides of nucleotide SEQ ID No. 2 to 1 7 or 79 to 
1 32, or a variant thereof or a sequence complementary thereto. 

The formation of stable hybrids depends on the melting temperature (Tm) of the DNA. 
The Tm depends on the length of the primer or probe, the ionic strength of the solution and the 
G+C content. The higher the G+C content of the primer or probe, the higher is the melting 
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temperature because G:C pairs are held by three H bonds whereas A:T pairs have only two. 
The GC content in the probes of the invention usually ranges between 10 and 75 %, preferably 
between 35 and 60 %, and more preferably between 40 and 55 %. 

A probe or a primer according to the invention may be between 8 and 2000 nucleotides 
in length, or is specified to be at least 12, 15, 18, 20, 25, 35, 40, 50, 60, 70, 80, 100, 250, 500 , 
1000 nucleotides in length. More particularly, the length of these probes can range from 8, 10, 
15, 20, or 30 to 100 nucleotides, preferably from 10 to 50, more preferably from 15 to 30 
nucleotides. Shorter probes tend to lack specificity for a target nucleic acid sequence and 
generally require cooler temperatures to form sufficiently stable hybrid complexes with the 
template. Longer probes are expensive to produce and can sometimes self-hybridize to form 
hairpin structures. The appropriate length for primers and probes under a particular set of assay 
conditions may be empirically determined by one of skill in the art. 

The primers and probes can be prepared by any suitable method, including, for 
example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 
method such as the phosphodiester method of Narang et al.(1979), the phosphodiester method 
of Brown et al.(1979), the diethylphosphoramidite method of Beaucage et al.(1981) and the 
solid support method described in EP 0 707 592. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid 
analogs such as, for example peptide nucleic acids which are disclosed in International Patent 
Application WO 92/20702, morpholino analogs which are described in U.S. Patents Numbered 
5,185,444; 5,034,506 and 5,142,047. The probe may have to be rendered "non-extendable" in 
that additional dNTPs cannot be added to the probe. In and of themselves analogs usually are 
non-extendable and nucleic acid probes can be rendered non-extendable by modifying the 3' end 
of the probe such that the hydroxyl group is no longer capable of participating in elongation. 
For example, the 3* end of the probe can be functionalized with the capture or detection label to 
thereby consume or otherwise block the hydroxyl group. Alternatively, the 3* hydroxyl group 
simply can be cleaved, replaced or modified; U.S. Patent Application Serial No. 07/049,061 
filed April 19, 1993, describes modifications which can be used to render a probe non- 
extendable. 

Any of the polynucleotides of the present invention can be labeled, if desired, by 
incorporating a label detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 

32 35 3 125 

substances ( P, S, H, I), fluorescent dyes (5-bromodesoxyuridin, fluorescein, 
acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at their 3' 
and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described in 
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the French patent No. FR-78 10975 or by Urdea et al (1988) or Sanchez-Pescador et al (1988). 
In addition, the probes according to the present invention may have structural characteristics 
such that they allow the signal amplification, such structural characteristics being, for example, 
branched DNA probes as those described by Urdea et al. in 1991 or in the European patent No. 
EP 0 225 807 (Chiron). 

A label can also be used to capture the primer, so as to facilitate the immobilization of 
either the primer or.a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which 
forms a binding pair with the solid's phase reagent's specific binding member (e.g. biotin and 
streptavidin). Therefore depending upon the type of label carried by a polynucleotide or a 
probe, it may be employed to capture or to detect the target DNA. Further, it will be understood 
that the polynucleotides, primers or probes provided herein, may, themselves, serve as the 
capture label. For example, in the case where a solid phase reagent's binding member is a 
nucleic acid sequence, it may be selected such that it binds a complementary portion of a primer 
or probe to thereby immobilize the primer or probe to the solid phase. In cases where a 
polynucleotide probe itself serves as the binding member, those skilled in the art will recognize 
that the probe will contain a sequence or "tail" that is not complementary to the target. In the 
case where a polynucleotide primer itself serves as the capture label, at least a portion of the 
primer will be free to hybridize with a nucleic acid on a solid phase. DNA Labeling techniques 
are well known to the skilled technician. 

The probes of the present invention are useful for a number of purposes. They can be 
notably used in Southern hybridization to genomic DNA. The probes can also be used to detect 
PCR amplification products. They may also be used to detect mismatches in a sequence 
comprising a polynucleotide of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132, or a g35030 
polynucleotide or gene or mRNA using other techniques. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. Solid supports are known to those skilled in the 
art and include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic 
beads, nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other 
animal) red blood cells, duracytes and others. The solid support is not critical and can be 
selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and duracytes are all suitable examples. 
Suitable methods for immobilizing nucleic acids on solid phases include ionic, hydrophobic, 
covalent interactions and the like. A solid support, as used herein, refers to any material which 



WO 01/40493 

PCT/IBOO/01507 

44 

is insoluble, or can be made insoluble by a subsequent reaction. The solid support can be 
chosen for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely 
charged with respect to the capture reagent itself or to a charged substance conjugated to the 
capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid support and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid support material before the 
performance of the assay or during the performance of the assay. The solid phase thus can be a 
plastic, derivatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
tube, microliter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, duracytes and other configurations known to those of ordinary skill in the art. The 
polynucleotides of the invention can be attached to or immobilized on a solid support 
individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the 
invention to a single solid support. In addition, polynucleotides other than those of the 
invention may be attached to the same solid support as one or more polynucleotides of the 
invention. 

Consequently, the invention also comprises a method for detecting the presence of a 
nucleic acid comprising a nucleotide sequence selected from a group consisting of SEQ ID Nos. 
1 to 17, 24, 25, 28, 74 and 79 to 132, a fragment or a variant thereof or a complementary 
sequence thereto in a sample, said method comprising the following steps of: 

a) bringing into contact a nucleic acid probe or a plurality of nucleic acid probes which 
can hybridize with a nucleotide sequence included in a nucleic acid selected form the group 
consisting of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132, a 
fragment or a variant thereof or a complementary sequence thereto and the sample to be 
assayed; and 

b) detecting the hybrid complex formed between the probe and a nucleic acid in the 

sample. 

The invention further concerns a kit for detecting the presence of a nucleic acid 
comprising a nucleotide sequence selected from a group consisting of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 132, a fragment or a variant thereof or a complementary sequence thereto 
in a sample, said kit comprising: 

a) a nucleic acid probe or a plurality of nucleic acid probes which can hybridize with a 
nucleotide sequence included in a nucleic acid selected form the group consisting of the 
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nucleotide sequences of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132, a fragment or a 
variant thereof or a complementary sequence thereto; and 

b) optionally, the reagents necessary for performing the hybridization reaction. 
In a first preferred embodiment of this detection method and kit, said nucleic acid probe 
or the plurality of nucleic acid probes are labeled with a detectable molecule. In a second 
preferred embodiment of said method and kit, said nucleic acid probe or the plurality of nucleic 
acid probes has been immobilized on a substrate. In a third preferred embodiment, the nucleic 
acid probe or the plurality of nucleic acid probes comprise either a sequence which is selected 
from the group consisting of the nucleotide sequences of PI to PI 27 and the complementary 
sequence thereto, Bl to Bl 14, CI to CI 14, Dl to D127, El to E127, or a nucleotide sequence 
comprising a biallelic marker selected from the group consisting of A I to A127, or the 
complements thereto. 

Oligonucleotide Arrays 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 
may be used either for detecting or amplifying targeted sequences in a nucleotide sequence of 
SEQ ID No. 1, more particularly in a g35030 polynucleotide, or in genes comprising a g35030 
polynucleotide and may also be used for detecting mutations in the coding or in the non-coding 
sequences of a g35030 nucleic acid sequence, or genes comprising a g35030 nucleic acid 
sequence. 

Any polynucleotide provided herein may be attached in overlapping areas or at random 
locations on the solid support. Alternatively the polynucleotides of the invention may be 
attached in an ordered array wherein each polynucleotide is attached to a distinct region of the 
solid support which does not overlap with the attachment site of any other polynucleotide. 
Preferably, such an ordered array of polynucleotides is designed to be "addressable" where the 
distinct locations are recorded and can be accessed as part of an assay procedure. Addressable 
polynucleotide arrays typically comprise a plurality of different oligonucleotide probes that are 
coupled to a surface of a substrate in different known locations. The knowledge of the precise 
location of each polynucleotides location makes these "addressable" arrays particularly useful 
in hybridization assays. Any addressable array technology known in the art can be employed 
with the polynucleotides of the invention. One particular embodiment of these polynucleotide 
arrays is known as Genechips™, and has been generally described in US Patent 5,143,854; 
PCT publications WO 90/15070 and 92/10092. These arrays may generally be produced using 
mechanical synthesis methods or light directed synthesis methods which incorporate a 
combination of photolithographic methods and solid phase oligonucleotide synthesis (Fodor et 
a!., 1991, incorporated herein by reference). The immobilization of arrays of oligonucleotides 
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on solid supports has been rendered possible by the development of a technology generally 
identified as "Very Large Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, 
typically, probes are immobilized in a high density array on a solid surface of a chip. Examples 
of VLSIPS™ technologies are provided in US Patents 5,143,854; and 5,412,087 and in PCT 
Publications WO 90/15070, WO 92/10092 and WO 95/1 1995, which describe methods for 
forming oligonucleotide arrays through techniques such as light-directed synthesis techniques. 
In designing strategies aimed at providing arrays of nucleotides immobilized on solid supports, 
further presentation strategies were developed to order and display the oligonucleotide arrays on 
the chips in an attempt to maximize hybridization patterns and sequence information. Examples 
of such presentation strategies are disclosed in PCT Publications WO 94/12305, WO 94/1 1530, 
WO 97/29212 and WO 97/31256. 

In another embodiment of the oligonucleotide arrays of the invention, an 
oligonucleotide probe matrix may advantageously be used to detect mutations occurring in a 
g35030 polynucleotide, including in genes comprising a g35030 polynucleotide and preferably 
in a g35030 polynucleotide regulatory region. For this particular purpose, probes are 
specifically designed to have a nucleotide sequence allowing their hybridization to the genes 
that carry known mutations (either by deletion, insertion or substitution of one or several 
nucleotides). By known mutations in a g35030 polynucleotide, it is meant, mutations in a 
g35030 polynucleotide that have been identified according; the technique used by Huang et 
al.(1996) or Samson et al.(1996), for example, may be used to identify such mutations. 

Another technique that is used to detect mutations in a g35030 polynucleotide is the use 
of a high-density DNA array. Each oligonucleotide probe constituting a unit element of the 
high density DNA array is designed to match a specific subsequence of a g35030 
polynucleotide. Thus, an array consisting of oligonucleotides complementary to subsequences 
of the target gene sequence is used to determine the identity of the target sequence with the 
wild-type gene sequence, measure its amount, and detect differences between the target 
sequence and the reference wild-type nucleic acid sequence of a g35030 polynucleotide. In one 
such design, termed 4L tiled array, is implemented a set of four probes (A, C, G, T), preferably 
1 5-nucleotide oligomers. In each set of four probes, the perfect complement will hybridize 
more strongly than mismatched probes. Consequently, a nucleic acid target of length L is 
scanned for mutations with a tiled array containing 4L probes, the whole probe set containing 
all the possible mutations in the known wild reference sequence. The hybridization signals of 
the 15-mer probe set tiled array are perturbed by a single base change in the target sequence. 
As a consequence, there is a characteristic loss of signal or a "footprint" for the probes flanking 
a mutation position. This technique was described by Chee et al. in 1996. 
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Consequently, the invention concerns an array of nucleic acid molecules comprising at 
least one polynucleotide described above as probes and primers. Preferably, the invention 
concerns an array of nucleic acid comprising at least two polynucleotides described above as 
probes and primers. 

G3S030 Proteins and Polypeptide Fragments: 

The terms " g35030 polypeptides ", are used herein to embrace all of the proteins and 
polypeptides encoded by the respective g35030 polypeptides of the present invention. Forming 
part of the invention are polypeptides encoded by the polynucleotides of the invention, as well 
as fusion polypeptides comprising such polypeptides. The invention embodies proteins from 
humans, mammals, primates, non-human primates, and includes isolated or purified g35030 
proteins consisting, consisting essentially, or comprising the sequence of SEQ ID Nos 1 8 to 23. 

It should be noted that the g35030 proteins of the invention also comprise naturally, 
occurring variants of the amino acid sequence of the respective human g35030 proteins. 

The present invention embodies isolated, purified, and recombinant polypeptides 
comprising a contiguous span of at least 4 amino acids, preferably at least 6, more preferably at 
least 8 to 10 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 95 
or 1 00 amino acids, to the extent that said span is consistent with the length of a particular SEQ • 
ID, of SEQ ID Nos 18 to 23. In other preferred embodiments the contiguous stretch of amino 
acids comprises the site of a mutation or functional mutation, including a deletion, addition, 
swap or truncation of the amino acids in a g35030 protein sequence. In one aspect, the 
contiguous stretch of amino acids comprises deletion, addition, swap or truncation at amino acid 
position 13 of SEQ ID Nos 18 to 23, corresponding to polymorphism described herein by 
biallelic markers 8-126-286 (also referred to as 8-130-143) and 8-155-258. 

The invention further embodies g35030 polypeptides, including isolated and 
recombinant polypeptides, encoded respectively by g35030 polynucleotides consisting, 
consisting essentially, or comprising a contiguous span of at least 12, 1 5, 1 8, 20, 25, 30, 35, 40, 
50, 60, 70, 80, 90, 1 00, 1 50, 200 or 500 nucleotides of SEQ ID Nos 2 to 1 7. 

The invention further embodies g35030 polypeptides, including isolated and 
recombinant polypeptides, encoded respectively by g35030 polynucleotides consisting, 
consisting essentially, or comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 
50, 60, 70, 80, 90, 1 00, 1 50, 200 or 500 nucleotides, to the extent that the length of said span is 
consistent with the nucleotide position range, of SEQ ID No 1, wherein said contiguous span 
comprises at least 1 , 2, 3, 4, 5, 7 or 1 0 of the following nucleotide positions of SEQ ID No 1 : 
201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
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216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 
to 240673, 240440 to 241 1 53, 241072 to 241291, 244353 to 244561 and 246273 to 247802, or 
the complements thereof. 

g35030 proteins are preferably isolated from human or mammalian tissue samples or 
expressed from human or mammalian genes. The g35030 polypeptides of the invention can be 
made using routine expression methods known in the art. The polynucleotide encoding the 
desired polypeptide, is ligated into an expression vector suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems is used in forming recombinant polypeptides, and a 
summary of some of the more common systems. The polypeptide is then isolated from lysed 
cells or from the culture medium and purified to the extent needed for its intended use. 
Purification is by any technique known in the art, for example, differential extraction, salt 
fractionation, chromatography, centrifiigation, and the like. See, for example, Methods in 
Enzymology for a variety of methods for purifying proteins. 

In addition, shorter protein fragments can be produced by chemical synthesis. 
Alternatively the proteins of the invention is extracted from cells or tissues of humans or non- 
human animals. Methods for purifying proteins are known in the art, and include the use of 
detergents or chaotropic agents to disrupt particles followed by differential extraction and 
separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
sedimentation according to density, and gel electrophoresis. 

Any g35030 cDNA or fragment thereof, including the respective cDNA sequences of SEQ 
ID Nos 2 to 17, is used to express g35030 proteins and polypeptides. The nucleic acid encoding 
the g35030 protein or polypeptide to be expressed is operably linked to a promoter in an expression . 
vector using conventional cloning technology. The g35030 insert in the expression vector may 
comprise the full coding sequence for the respective g35O30 protein or a portion thereof. For 
example, the g35030 derived insert may encode a polypeptide comprising at least 1 0 consecutive 
amino acids of the respective g35030 protein of SEQ ID Nos 1 8 to 23. 

The expression vector is any of the mammalian, yeast, insect or bacterial expression 
systems known in the art. Commercially available vectors and expression systems are available 
from a variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to 
enhance expression and facilitate proper protein folding, the codon context and codon pairing of the 
sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained by Hatfield, et al., U.S. Patent No. 5,082,767. 

In one embodiment, the entire coding sequence of the g35030 cDNA through the poly A 
signal of the cDNA are operably linked to a promoter in the expression vector. Alternatively, if the 
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nucleic acid encoding a portion of the g35030 protein lacks a methionine to serve as the initiation 
site, an initiating methionine can be introduced next to the first codon of the nucleic acid using 
conventional techniques. Similarly, if the insert from the g35030 cDNA lacks a poly A signal, this 
sequence can be added to the construct by, for example, splicing out the Poly A signal from pSG5 
(Stratagene) using Bgll and Sail restriction endonuclease enzymes and incorporating it into the 
mammalian expression vector pXTl (Stratagene). pXTI contains the LTRs and a portion of the 
gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in the construct allow 
efficient stable trarisfection. The vector includes the Herpes Simplex Thymidine Kinase promoter 
and the selectable neomycin gene. The nucleic acid encoding the g35030 protein or a portion 
thereof is obtained by PCR from a bacterial vector containing the a nucleotide sequence of an exon 
of a g35030 gene as described herein and in SEQ ID No 1, or from a g35030 cDNA comprising a 
nucleic acid of SEQ ID No 2 to 1 7 using oligonucleotide primers complementary to the g35030 
nucleic acid or portion thereof and containing restriction endonuclease sequences for Pst I 
incorporated into the 5' primer and Bglll at the 5' end of the corresponding cDNA 3' primer, taking 
care to ensure that the sequence encoding the g35030 protein or a portion thereof is positioned 
properly with respect to the poly A signal. The purified fragment obtained from the resulting PCR 

reaction is digested with Pstl, blunt ended with an exonuclease, digested with Bgl II, purified and 
ligated to pXTI ,.now containing a poly A signal and digested with Bglll. 

The ligated product is transfected into mouse NTH 3T3 cells using Lipofectin (Life 

Technologies, Inc., Grand Island, New York) under conditions outlined in the product 

specification. Positive transfectants are selected after growing the transfected cells in 600ug/ml 

G41 8 (Sigma, St. Louis, Missouri). 

Alternatively, the nucleic acids encoding the g35030 protein or a portion thereof is cloned 

into pED6dpc2 (Genetics Institute, Cambridge, MA). The resulting pED6dpc2 constructs is 

transfected into a suitable host cell, such as COS 1 cells. Methotrexate resistant cells are selected 

and expanded. 

The above procedures may also be used to express a mutant g35030 protein responsible 
for a detectable phenotype or a portion thereof. 

The expressed proteins are purified using conventional purification techniques such as 
ammonium sulfate precipitation or chromatographic separation based on size or charge. The 
protein encoded by the nucleic acid insert may also be purified using standard 
immunochromatography techniques. In such procedures, a solution containing the expressed 
g35030 protein or portion thereof, such as a cell extract, is applied to a column having antibodies 
against the g35030 protein or portion thereof is attached to the chromatography matrix. The 
expressed protein is allowed to bind the immunochromatography column. Thereafter, the column 
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is washed to remove non-specifically bound proteins. The specifically bound expressed protein is 
then released from the column and recovered using standard techniques. 

To confirm expression of the g35030 protein or a portion thereof, the proteins expressed 
from host cells containing an expression vector containing an insert encoding the g3 5030 protein or 
a portion thereof can be compared to the proteins expressed in host cells containing the expression 
vector without an insert. The presence of a band in samples from cells containing the expression 
vector with an insert which is absent in samples from cells containing the expression vector without 
an insert indicates that the g35030 protein or a portion thereof is being expressed. Generally, the 
band will have the mobility expected for the g35030 protein or portion thereof. However, the band 
may have a mobility different than that expected as a result of modifications such as glycosylation, 
ubiquitination, or enzymatic cleavage. 

Antibodies capable of specifically recognizing the expressed g35030 protein or a portion 
thereof are described below. 

If antibody production is not possible, the nucleic acids encoding the g35030 protein or a 
portion thereof is incorporated into expression vectors designed for use in purification schemes 
employing chimeric polypeptides. In such strategies the nucleic acid encoding the g3 5030 protein 
or a portion thereof is inserted in frame with the gene encoding the other half of the chimera. The 
other half of the chimera is p-globin or a nickel binding polypeptide encoding sequence. A 
chromatography matrix having antibody to p-globin or nickel attached thereto is then used to purify 
the chimeric protein. Protease cleavage sites is engineered between the p-globin gene or the nickel 
binding polypeptide and the g35030 protein or portion thereof. Thus, the two polypeptides of the 
chimera is separated from one another by protease digestion. 

One useful expression vector for generating P-globin chimeric proteins is pSG5 
(Stratagene), which encodes rabbit p-globin. Intron II of the rabbit P-globin gene facilitates 
splicing of the expressed transcript, and the polyadenylation signal incorporated into the construct 
increases the level of expression. These techniques are well known to those skilled in the art of 
molecular biology. Standard methods are published in methods texts such as Davis et al, (1986) 
and many of the methods are available from Stratagene, Life Technologies, Inc., or Promega. 
Polypeptide may additionally be produced from the construct using in vitro translation systems 
such as the In vitro Express™ Translation Kit (Stratagene). 

Antibodies That Bind g35030 Polypeptides of the Invention 

Any g35030 polypeptide or whole protein may be used to generate antibodies capable 
of specifically binding to an expressed g35030 protein or fragments thereof. 

For an antibody composition to specifically bind to a g35030 protein, it must 
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demonstrate at least a 5%, 10%, 15%, 20%, 25%, 50%, or 100% greater binding affinity for foil 
length g35030 protein than for any full length protein in an EL1SA, RIA, or other antibody- 
based binding assay. For an antibody composition to specifically bind to a variant g35030 
protein, it must demonstrate at least a 5%, 10%, 15%, 20%, 25%, 50%, or 100% greater binding 
affinity for the respective full length variant g35030 protein than for the respective reference 
g35030 full length protein in an ELISA, RIA, or other antibody-based binding assay. 

One antibody composition of the invention is capable of specifically binding or 
specifically binds to the respective g35030 proteins of SEQ ID Nos 1 8 to 23. Other antibody 
compositions of the invention are capable of specifically binding or specifically bind to a 
g35030 protein variant 

In one embodiment, the invention concerns antibody compositions, either polyclonal or 
monoclonal, capable of selectively binding, or selectively bind to an epitope-containing a 
polypeptide comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a g35030 
polypeptide. 

The invention also concerns a purified or isolated antibody capable of specifically 
binding to a mutated g35030 protein or to a fragment or variant thereof comprising an epitope 
of the mutated g35030 protein. In another preferred embodiment, the present invention 
concerns an antibody capable of binding to a polypeptide comprising at least 10 consecutive 
amino acids of a g35030 protein and including at least one of the amino acids which can be 
encoded by the trait causing mutations. 

An epitope can comprise as few as 3 amino acids in a spatial conformation, which is 
unique to the epitope. Generally an epitope consists of at least 6 such amino acids, and more 
often at least 8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a 
number of amino acids that is any integer between 3 and 50. Fragments which function as 
epitopes may be produced by any conventional means. Epitopes can be determined by a 
Jameson- Wolf antigenic analysis, for example, performed using the computer program 
PROTEAN, using default parameters (Version 4.0 Windows, DNASTAR, Inc., 1228 South Park 
Street Madison, WI. 

Predicted antigenic epitopes are shown below. It is pointed out that the immunogenic 
epitope list describe only amino acid residues comprising epitopes predicted to have the highest 
degree of immunogenicity by a particular algorithm. Polypeptides of the present invention that 
are not specifically described as immunogenic are not considered non-antigenic. This is 
because they may still be antigenic in vivo but merely not recognized as such by the particular 
algorithm used. Alternatively, the polypeptides are probably antigenic in vitro using methods 
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such a phage display. Thus, listed below are the amino acid residues comprising only preferred 
epitopes, not a complete list. In fact, all fragments of the polypeptides of the present invention, 
at least 6 amino acids residues in length, are included in the present invention as being useful as 
antigenic epitope. Moreover, listed below are only the critical residues of the epitopes 
determined by the Jameson- Wolf analysis. Thus, additional flanking residues on either the N- 
terminal, C-terminal, or both N- and C-terminal ends may be added to the sequences listed to 
generate an epitope-bearing portion at least 6 residues in length. Amino acid residues 
comprising other immunogenic epitopes may be determined by algorithms similar to the 
Jameson- Wolf analysis or by in vivo testing for an antigenic response using the methods 
described herein or those known in the art. 

The epitope-bearing fragments of the present invention preferably comprises 6 to 50 
amino acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present 
invention. Also, included in the present invention are antigenic fragments between the integers 
of 6 and the full length g35030 sequence. 

Preferred e35030 immunogenic epitopes : 

SEQ ID No. 18: Val-9 to Arg-1 1; Gly-21 to Lys-22 

SEQ ID No. 1 9: Val-9 to Arg-1 1 ; Pro-45 to Pro-52 

SEQ ID No. 20: Val-9 to Arg-1 1 

SEQ ID No. 21 : Val-9 to Arg-1 1 ; Ser-40 to Ser-42 

SEQ ID No. 22: Val-9 to Arg-1 1 

SEQ ID No. 23: Val-9 to Arg-1 1; Pro-45 to Glu-49; Ser-57 to Val-60 

Non-human animals or mammals, whether wild-type or transgenic, which express a 
different species of g35030 than the one to which antibody binding is desired, and animals 
which do not express g35030 (i.e. a g35030 knock out animal as described herein) are 
particularly useful for preparing antibodies. g35030 knock out animals will recognize all or 
most of the exposed regions of a g35030 protein as foreign antigens, and therefore produce 
antibodies with a wider array of g35030 epitopes. Moreover, smaller polypeptides with only 10 
to 30 amino acids may be useful in obtaining specific binding to any one of the g35030 proteins. 
In addition, the humoral immune system of animals which produce a species of g35030 that 
resembles the antigenic sequence will preferentially recognize the differences between the 
animal's native g35030 species and the antigen sequence, and produce antibodies to these 
unique sites in the antigen sequence. Such a technique will be particularly useful in obtaining 
antibodies that specifically bind to any one of the g35030 proteins. 

Antibody preparations prepared according to either protocol are useful in quantitative 



WO 01/40493 

PCT/IB00/01507 

53 

immunoassays which determine concentrations of antigen-bearing substances in biological 
samples; they are also used semi-quantitatively or qualitatively to identify the presence of 
antigen in a biological sample. The antibodies may also be used in therapeutic compositions for 
killing cells expressing the protein or reducing the levels of the protein in the body. 

The antibodies of the invention may be labeled by any one of the radioactive, 
fluorescent or enzymatic labels known in the art. 

Consequently, the invention is also directed to a method for detecting specifically the 
presence of a g35030 polypeptide according to the invention in a biological sample, said 
method comprising the following steps: 

a) bringing into contact the biological sample with a polyclonal or monoclonal antibody 
that specifically binds a g35030 polypeptide comprising an amino acid sequence of SEQ ID 
Nos. 18 to 23, or to a peptide fragment or variant thereof; and 

b) detecting the antigen-antibody complex formed. 

The invention also concerns a diagnostic kit for detecting in vitro the presence of a 
g35030 polypeptide according to the present invention in a biological sample, wherein said kit 
comprises: 

a) a polyclonal or monoclonal antibody that specifically binds a g35030 polypeptide 
comprising an amino acid sequence of SEQ ID Nos 18 to 23, or to a peptide fragment or variant 
thereof, optionally labeled; 

b) a reagent allowing the detection of the antigen-antibody complexes formed, said 
reagent carrying optionally a label, or being able to be recognized itself by a labeled reagent, 
more particularly in the case when the above-mentioned monoclonal or polyclonal antibody is 
not labeled by itself. 

The present invention thus relates to antibodies and T-cell antigen receptors (TCR), 
which specifically bind the polypeptides, and more specifically, the epitopes of the 
polyepeptides of the present invention, including but not limited to IgG (including JgG 1, IgG2, 
IgG3, and IgG4), IgA (including IgAl and IgA2), IgD, IgE, or IgM, and IgY. In a preferred 
embodiment the antibodies are human antigen binding antibody fragments of the present 
invention include, but are not limited to, Fab, Fab" F(ab)2 and F(ab')2, Fd, single-chain Fvs 
(scFv), single-chain antibodies, disulfide-linked Fvs (sdFv) and fragments comprising either a 
V L or V H domain. The antibodies may be from any animal origin including birds and mammals. 
Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise 
the variable region(s) alone or in combination with the entire or partial of the following: hinge 
region, CHI, CH2, and CH3 domains. Also included in the invention are any combinations of 
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variable region(s) and hinge region, CHI, CH2, and CH3 domains. The present invention 
further includes chimeric, humanized, and human monoclonal and polyclonal antibodies, which 
specifically bind the polypeptides of the present invention. The present invention further 
includes antibodies that are anti-idiotypic to the antibodies of the present invention. 

The antibodies of the present invention may be monospecific, bispecific, trispecific or 
have greater multispecificity. Multispecific antibodies may be specific for different epitopes of 
a polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 
support material. See, e.g., WO 93/17715; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, 
A. et al. (1991) J. Immunol. 147:60-69; US Patents 5,573,920, 4,474,893, 5,601,819, 4,714,681, 
4,925,648; Kostelny, S.A. et al. (1992) J. Immunol. 148:1547-1553. 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or epitope-bearing portion(s) of a polypeptide of the present invention, which are 
recognized or specifically bound by the antibody. In the case of proteins of the present 
invention secreted proteins, the antibodies may specifically bind a full-length protein encoded 
by a nucleic acid of the present invention, a mature protein (i.e., the protein generated by 
cleavage of the signal peptide) encoded by a nucleic acid of the present invention, a signal 
peptide encoded by a nucleic acid of the present invention, or any other polypeptide of the 
present invention. Therefore, the epitope(s) or epitope bearing polypeptide portion(s) may be 
specified as described herein, e.g., by N-terminal and C-terminal positions, by size in contiguous 
amino acid residues, or otherwise described herein (including the squence listing). Antibodies 
which specifically bind any epitope or polypeptide of the present invention may also be 
excluded as individual species. Therefore, the present invention includes antibodies that 
specifically bind specified polypeptides of the present invention, and allows for the exclusion of 
the same. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homolog 
of the polypeptides of the present invention are included. Antibodies that do not bind 
polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, 
less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% identity (as 
calculated using methods known in the art and described herein) to a polypeptide of the present 
invention are also included in the present invention. Further included in the present invention 
are antibodies, which only bind polypeptides encoded by polynucleotides, which hybridize to a 
polynucleotide of the present invention under stringent hybridization conditions (as described 
herein). Antibodies of the present invention may also be described or specified in terms of their 
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binding affinity. Preferred binding affinities include those with a dissociation constant or Kd 
less than 5X10" 6 M, lO^M, 5X10" 7 M, lO^M, 5X10 ^, 10" 8 M, 5X10" 9 M, lO 'M, 5X10 ,0 M, 10" 
l0 M, 5X10"M, 10"M, 5X10- |2 M, 10- ,2 M, 5X10" 13 M, 10" I3 M, 5X10"^, 10 M M, 5X10 ,5 M, and 
10" I5 M. 

Antibodies of the present invention have uses that include, but are not limited to, 
methods known in the art to purify, detect, and target the polypeptides of the present invention 
including both in vitro and in vivo diagnostic and therapeutic methods. For example, the 
antibodies have use in immunoassays for qualitatively and quantitatively measuring levels of the 
polypeptides of the present invention in biological samples. See, e.g., Harlow et al., 
ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 
1 988) (incorporated by reference in the entirety). 

The antibodies of the present invention may be used either alone or in combination with 
other compositions. The antibodies may further be recombinantly fused to a heterologous 
polypeptide at the N- or C-terminus or chemically conjugated (including covalent and non- 
covalent conjugations) to polypeptides or other compositions. For example, antibodies of the 
present invention may be recombinantly fused or conjugated to molecules useful as labels in 
detection assays and effector molecules such as heterologous polypeptides, drugs, or toxins. See, 
e.g., WO 92/08495; WO 91/14438; WO 89/12624; US Patent 5,314,995; and EP 0 396 387. 

The antibodies of the present invention may be prepared by any suitable method known 
in the art. For example, a polypeptide of the present invention or an antigenic fragment thereof 
can be administered to an animal in order to induce the production of sera containing polyclonal 
antibodies. The term "monoclonal antibody" is not limited to antibodies produced through 
hybridoma technology. The term "antibody" refers to a polypeptide or group of polypeptides 
which are comprised of at least one binding domain, where a binding domain is formed from 
the folding of variable domains of an antibody molecule to form three-dimensional binding 
spaces with an internal surface shape and charge distribution complementary to the features of 
an antigenic determinant of an antigen., which allows an immunological reaction with the 
antigen. The term "monoclonal antibody" refers to an antibody that is derived from a single 
clone, including eukaryotic, prokaryotic, or phage clone, and not the method by which it is 
produced. Monoclonal antibodies can be prepared using a wide variety of techniques known in 
the art including the use of hybridoma, recombinant, and phage display technology. 

Hybridoma techniques include those known in the art (See, e.g., Harlow et al. (1998); 
Hammerling, et al. (1981) (said references incorporated by reference in their entireties). Fab 
and F(ab')2 fragments may be produced, for example, from hybridoma-produced antibodies by 
proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to 
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produce F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the 
application of recombinant DNA technology or through synthetic chemistry using methods 
known in the art. For example, the antibodies of the present invention can be prepared using 
various phage display methods known in the art. In phage display methods, functional antibody 
domains are displayed on the surface of a phage particle, which carries polynucleotide 
sequences encoding them. Phage with a desired binding property are selected from a repertoire 
or combinatorial antibody library (e.g. human or murine) by selecting directly with antigen, 
typically antigen bound or captured to a solid surface or bead. Phage used in these methods are 
typically filamentous phage including fd and Ml 3 with Fab, Fv or disulfide stabilized Fv 
antibody domains recombinantly fused to either the phage gene III or gene VIII protein. 
Examples of phage display methods that can be used to make the antibodies of the present 
invention include those disclosed in Brinkman U. et al. (1995); Ames, R.S. et al. (1995); 
Kettleborough, C.A. et al. (1994); Persic, L. et al. (1997); Burton, D.R. et al. (1994); 
PCT/GB91/01134; WO 90/02809; WO9I/10737; WO 92/01047; WO 92/18619; WO 93/1 1236; 
WO 95/15982; WO 95/20401; and US Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 
5,427,908, 5,750,753, 5,821,047, 5,571,698, 5,427,908, 5,516,637, 5,780,225, 5,658,727 and 
5,733,743 (said references incorporated by reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 
from the phage can be isolated and used to generate whole antibodies, including human 
antibodies, or any other desired antigen binding fragment, and expressed in any desired host 
including mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, 
techniques to recombinantly produce Fab, Fab' F(ab)2 and F(ab')2 fragments can also be 
employed using methods known in the art such as those disclosed in WO 92/22324; Mullinax, 
R.L. et al. (1992); and Sawai, H. et al. (1995); and Better, M. et al. (1988) (said references 
incorporated by reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al. (1991); Shu, L. et 
al. (1993); and Skerra, A. et al. (1988). For some uses, including in vivo use of antibodies in 
humans and in vitro detection assays, it may be preferable to use chimeric, humanized, or 
human antibodies. Methods for producing chimeric antibodies are known in the art. See e.g., 
Morrison (1985); Oi et al., (1986); Gillies, S.D. et al. (1989); and US Patent 5,807,715. 
Antibodies can be humanized using a variety of techniques including CDR-grafting (EP 0 239 
400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing (EP 0 592 
106; EP 0 5 19 596; Padlan E.A., (1991); Studnicka G.M. et al. (1994); Roguska M.A. et al. 
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(1994), and chain shuffling (US Patent 5,565,332). Human antibodies can be made by a variety 
of methods known in the art including phage display methods described above. See also, US 
Patents 4,444,887, 4,716,1 1 1, 5,545,806, and 5,814,318; WO 98/46645; WO 98/50433; WO 
98/24893; WO 96/34096; WO 96/33735; and WO 91/10741 (said references incorporated by 
reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or 
chemically conjugated (including both covalently and non-covalently conjugations) to a 
polypeptide of the present invention. The antibodies may be specific for antigens other than 
polypeptides of the present invention. For example, antibodies may be used to target the 
polypeptides of the present invention to particular cell types, either in vitro or in vivo, by fusing 
or conjugating the polypeptides of the present invention to antibodies specific for particular cell 
surface receptors. Antibodies fused or conjugated to the polypeptides of the present invention 
may also be used in in vitro immunoassays and purification methods using methods known in 
the art. See e.g., Harbor et al. supra and WO 93/21232; EP 0 439 095; Naramura, M. et al. 
(1994); US Patent 5,474,981; Gillies, S.O. etal. (1992); Fell, H.P. et al. (1991) (said references 
incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions. For 
example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 
region, or portion thereof. The antibody portion fused to a polypeptide of the present invention 
may comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any 
combination of whole domains or portions thereof. The polypeptides of the present invention 
may be fused or conjugated to the above antibody portions to increase the in vivo half-life of the 
polypeptides or for use in immunoassays using methods known in the art. The polypeptides 
may also be fused or conjugated to the above antibody portions to form multimers. For 
example, Fc portions fused to the polypeptides of the present invention can form dimers through 
disulfide bonding between the Fc portions. Higher multimeric forms can be made by fusing the 
polypeptides to portions of IgA and IgM. Methods for fusing or conjugating the polypeptides of 
the present invention to antibody portions are known in the art. See e.g., US Patents 5,336,603, 
5,622,929, 5,359,046, 5,349,053, 5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 
96/04388, WO 91/06570; Ashkenazi, A. et al. (1 991); Zheng, X.X. et al. (1995); and Vil, H. et 
al. (1992) (said references incorporated by reference in their entireties). 

The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies 
that disrupt the receptor/ligand interactions with the polypeptides of the invention either 
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partially or fully. Included are both receptor-specific antibodies and ligand-specific antibodies. 
Included are receptor-specific antibodies, which do not prevent ligand binding but prevent 
receptor activation. Receptor activation (i.e., signaling) may be determined by techniques 
described herein or otherwise known in the art. Also include are receptor-specific antibodies 
which both prevent ligand binding and receptor activation. Likewise, included are neutralizing 
antibodies that bind the ligand and prevent binding of the ligand to the receptor, as well as 
antibodies that bind the ligand, thereby preventing receptor activation, but do not prevent the 
ligand from binding the receptor. Further included are antibodies that activate the receptor. 
These antibodies may act as agonists for either all or less than all of the biological activities 
affected by ligand-mediated receptor activation. The antibodies may be specified as agonists or 
antagonists for biological activities comprising specific activities disclosed herein. The above 
antibody agonists can be made using methods known in the art. See e.g., WO 96/40281; US 
Patent 5,81 1,097; Deng, B. et al. (1998); Chen, Z. et al. (1998); Harrop, J.A. et al. (1998); Zhu, 
Z. et al. (1998); Yoon, D.Y. et al. (1998); Prat, M. et al. (1998); Pitard, V. et al. (1997); 
Liautard, J. et al. (1997); Carlson, N.G. et al. (1997); Taryman, R.E. et al. (1995); Muller, Y.A. 
et al. (1998); Bartunek, P. et al. (1996) (said references incorporated by reference in their 
entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be 
utilized to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using 
techniques well known to those skilled in the art. See, e.g. Greenspan and Bona, (1989); 
Nissinoff, (1991). For example, antibodies which bind to and competitively inhibit polypeptide 
multimerization or binding of a polypeptide of the invention to ligand can be used to generate 
anti-id ioty pes that "mimic" the polypeptide multimerization or binding domain and, as a 
consequence, bind to and neutralize polypeptide or its ligand. Such neutralization anti-idiotypic 
antibodies can be used to bind a polypeptide of the invention or to bind its ligands/receptors, 
and therby block its biological activity. 

Biallelic markers of the inventions 

Advantages of the biallelic markers of the present invention 

The biallelic marker of the inventions of the present invention offer a number of 
important advantages over other genetic markers such as RFLP (Restriction fragment length 
polymorphism) and VNTR (Variable Number of Tandem Repeats) markers. 

The first generation of markers, were RFLPs, which are variations that modify the 
length of a restriction fragment. But methods used to identify and to type RFLPs are relatively 
wasteful of materials, effort, and time. The second generation of genetic markers were VNTRs, 
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which can be categorized as either minisatellites or microsatellites. Minisatellites are tandemly 
repeated DNA sequences present in units of 5-50 repeats which are distributed along regions of 
the human chromosomes ranging from 0.1 to 20 kilobases in length. Since they present many 
possible alleles, their informative content is very high. Minisatellites are scored by performing 
Southern blots to identify the number of tandem repeats present in a nucleic acid sample from 

4 

the individual being tested. However, there are only 10 potential VNTRs that can be typed by 
Southern blotting. Moreover, both RFLP and VNTR markers are costly and time-consuming to 
develop and assay in large numbers. 

Single nucleotide polymorphism or biallelic markers can be used in the same manner as 
RFLPs and VNTRs but offer several advantages. Single nucleotide polymorphisms are densely 
spaced in the human genome and represent the most frequent type of variation. An estimated 

number of more than 10 sites are scattered along the 3x1 0 9 base pairs of the human genome. 
Therefore, single nucleotide polymorphism occur at a greater frequency and with greater 
uniformity than RFLP or VNTR markers which means that there is a greater probability that 
such a marker will be found in close proximity to a genetic locus of interest. Single nucleotide 
polymorphisms are less variable than VNTR markers but are mutationally more stable. 

Also, the different forms of a characterized single nucleotide polymorphism, such as the 
biallelic markers of the present invention, are often easier to distinguish and can therefore be 
typed easily on a routine basis. Biallelic markers have single nucleotide based alleles and they 
have only two common alleles, which allows highly parallel detection and automated scoring. 
The biallelic markers of the present invention offer the possibility of rapid, high-throughput 
genotyping of a large number of individuals. 

Biallelic markers are densely spaced in the genome, sufficiently informative and can be 
assayed in large numbers. The combined effects of these advantages make biallelic markers 
extremely valuable in genetic studies. Biallelic markers can be used in linkage studies in 
families, in allele sharing methods, in linkage disequilibrium studies in populations, in 
association studies of case-control populations. An important aspect of the present invention is 
that biallelic markers allow association studies to be performed to identify genes involved in 
complex traits. Association studies examine the frequency of marker alleles in unrelated case- 
and control-populations and are generally employed in the detection of polygenic or sporadic 
traits. Association studies may be conducted within the general population and are not limited 
to studies performed on related individuals in affected families (linkage studies). Biallelic 
markers in different genes can be screened in parallel for direct association with disease or 
response to a treatment. This multiple gene approach is a powerful tool for a variety of human 
genetic studies as it provides the necessary statistical power to examine the synergistic effect of 
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multiple genetic factors on a particular phenotype, drug response, sporadic trait, or disease state 
with a complex genetic etiology. 



Biallelic Markers And Polynucleotides Comprising Them 
Polynucleotides of the present invention 

In one aspect, the invention concerns biallelic markers associated with schizophrenia. 
The invention comprises chromosome 13q3 l-q33-related biallelic markers and g35030-related 
biallelic markers. The markers are generally referred to herein as Al, A2, A3 and so on. The 
biallelic markers of the invention comprise the biallelic markers designated Al to A127 in 
Table 5. Also included are biallelic markers in linkage disequilibrium with the biallelic markers 
of the invention. 

The polynucleotide of the invention may consist of, consist essentially of, or comprise a 
contiguous span of nucleotides of a sequence from any of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 
and 79 to 132 as well as sequences which are complementary thereto ("complements thereof). 
The "contiguous span" may be at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, 
500, 1 000 or 2000 nucleotides in length, to the extent that a contiguous span of these lengths is 
consistent with the lengths of the particular Sequence ID. 

The present invention encompasses polynucleotides for use as primers and probes in the 
methods of the invention. These polynucleotides may consist of, consist essentially of, or 
comprise a contiguous span of nucleotides of a sequence from any of SEQ ID Nos. I to 17, 24, 
25, 28, 74 and 79 to 132 as well as sequences which are complementary thereto ("complements 
thereof). The "contiguous span" may be at least 8, 10, 12, 1 5, 1 8, 20, 25, 35, 40, 50, 70, 80, 
100, 250, 500 , 1000 or 2000 nucleotides in length, to the extent that a contiguous span of these 
lengths is consistent with the lengths of the particular Sequence ID. It should be noted that the 
polynucleotides of the present invention are not limited to having the exact flanking sequences 
surrounding the polymorphic bases which, are enumerated in the Sequence Listing. Rather, it 
will be appreciated that the flanking sequences surrounding the biallelic markers and other 
polymorphisms of the invention, or any of the primers of probes of the invention which, are 
more distant from the markers, may be lengthened or shortened to any extent compatible with 
their intended use and the present invention specifically contemplates such sequences. It will be 
appreciated that the polynucleotides of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 may 
be of any length compatible with their intended use. Also the flanking regions outside of the 
contiguous span need not be homologous to native flanking sequences which actually occur in 
human subjects. The addition of any nucleotide sequence, which is compatible with the 
nucleotides intended use is specifically contemplated. The contiguous span may optionally 
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include the biallelic markers of the invention in said sequence. Biallelic markers generally 
comprise a polymorphism at one single base position. Each biallelic marker therefore 
corresponds to two forms of a polynucleotide sequence which, when compared with one 
another, present a nucleotide modification at one position. Usually, the nucleotide modification 
involves the substitution of one nucleotide for another. Optionally allele 1 or allele 2 of the 
biallelic markers disclosed in Table 5 may be specified as being present at the biallelic marker 
of the invention. Preferred polynucleotides may consist of, consist essentially of, or comprise a 
contiguous span of nucleotides of a sequence from SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 
to 132 as well as sequences which are complementary thereto. The "contiguous span" may be 
at least 8, 10, 12, 15, 18,20,25,35,40,50, 70, 80, 100,250, 500, 1 000 or 2000 nucleotides in 
length, to the extent that a contiguous span of these lengths is consistent with the lengths of the 
particular Sequence ID. 

A preferred probe or primer comprises a nucleic acid comprising a polynucleotide 
selected from the group of the nucleotide sequences of PI to PI 27 and the complementary 
sequence thereto, Bl to Bl 14, CI to CI 14, Dl to DI27, El to E127. 

The invention also relates to polynucleotides that hybridize, under conditions of high or 
intermediate stringency, to a polynucleotide of any of SEQ ID Nos. I to 17, 24, 25, 28, 74 and 
79 to 132 as well as sequences, which are complementary thereto. Preferably such 
polynucleotides are at least 20, 25, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000 nucleotides 
in length, to the extent that a polynucleotide of these lengths is consistent with the lengths of the 
particular Sequence ID. Preferred polynucleotides comprise a polymorphism of the invention. 
Optionally either allele 1 or allele 2 of the polymorphism disclosed in Table 5 may be specified 
as being present at the polymorphism of the invention. Particularly preferred polynucleotides 
comprise a biallelic marker of the invention. Conditions of high stringency are further 
described herein. 

The primers of the present invention may be designed from the disclosed sequences for 
any method known in the art. A preferred set of primers is fashioned such that the 3' end of the 
contiguous span of identity with the sequences of any of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 
and 79 to 1 32 is present at the 3' end of the primer. Such a configuration allows the 3' end of 
the primer to hybridize to a selected nucleic acid sequence and dramatically increases the 
efficiency of the primer for amplification or sequencing reactions. In a preferred set of primers 
the contiguous span is found in one of the sequences described in Table 4. Allele specific 
primers may be designed such that a biallelic marker or other polymorphism of the invention is 
at the 3* end of the contiguous span and the contiguous span is present at the 3' end of the 
primer. Such allele specific primers tend to selectively prime an amplification or sequencing 
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reaction so long as they are used with a nucleic acid sample that contains one of the two alleles 
present at said marker. The 3' end of primer of the invention may be located within oral least 
2, 4, 6, 8, 10, 12, 15, 18, 20, 25, 50, 100, 250, 500, or 1000 nucleotides upstream of a biallelic 
marker of the invention in said sequence or at any other location which is appropriate for their 
intended use in sequencing, amplification or the location of novel sequences or markers. 
Primers with their 3 1 ends located 1 nucleotide upstream of an biallelic marker of the invention 
have a special utility as microsequencing assays. Preferred microsequencing primers are 
described in Table 6. 

The probes of the present invention may be designed from the disclosed sequences for 
any method known in the art, particularly methods which allow for testing if a particular 
sequence or marker disclosed herein is present. A preferred set of probes may be designed for 
use in the hybridization assays of the invention in any manner known in the art such that they 
selectively bind to one allele of a biallelic marker or other polymorphism, but not the other 
under any particular set of assay conditions. Preferred hybridization probes may consists of, 
consist essentially of, or comprise a contiguous span which ranges in length from 8, 10, 12, 1 5, 
18 or 20 to 25, 35, 40, 50, 60, 70, or 80 nucleotides, or be specified as being 12, 15, 18,20,25,' 
35, 40, or 50 nucleotides in length and including an biallelic marker or other polymorphism of 
the invention in said sequence. In a preferred embodiment, either of allele 1 or 2 disclosed in 
Table 5 may be specified as being present at the biallelic marker site. In another preferred 
embodiment, said biallelic marker may be within 6, 5, 4, 3, 2, or 1 nucleotides of the center of 
the hybridization probe or at the center of said probe. 

In one embodiment the invention encompasses isolated, purified, and recombinant 
polynucleotides comprising, consisting of, or consisting essentially of a contiguous span of 8 to 
50 nucleotides of any one of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32 and the 
complement thereof, wherein said span includes a biallelic marker of the invention, a 
chromosome 13q3 l-q33-related biallelic marker or g35030-related biallelic marker in said 
sequence; optionally, wherein said polymorphism or biallelic marker selected from the group 
consisting of A I to A 127, and the complements thereof, or optionally the biallelic markers in 
linkage disequilibrium therewith; optionally, wherein said biallelic marker is selected from the 
group consisting of A13 to A 18, A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, 
A76 and A 123 ; optionally, wherein said biallelic marker is selected from the group consisting 
of A13 to A65, and the complements thereof, or optionally the biallelic markers in linkage 
disequilibrium therewith; optionally, wherein said contiguous span is 18 to 35 nucleotides in 
length and said biallelic marker is within 4 nucleotides of the center of said polynucleotide; 
optionally, wherein said polynucleotide consists of said contiguous span and said contiguous 



WO 01/40493 PCT/IB00/01507 

63 

span is 25 nucleotides in length and said biallelic marker is at the center of said polynucleotide; 
optionally, wherein the 3* end of said contiguous span is present at the 3* end of said 
polynucleotide; and optionally, wherein the 3* end of said contiguous span is located at the 3* 
end of said polynucleotide and said biallelic marker is present at the 3' end of said 
polynucleotide. In a preferred embodiment, said probes comprise, consists of, or consists 
essentially of a sequence selected from the following sequences: PI to P127 and the 
complementary sequences thereto. 

In another embodiment the invention encompasses isolated, purified and recombinant 
polynucleotides comprising, consisting of, or consisting essentially of a contiguous span of 8 to 
50 nucleotides of any one of SEQ ID Nos 1 to 1 7, 24, 25,28, 74 and 79 to 1 32, or the 
complement thereof, wherein the 3' end of said contiguous span is located at the 3' end of said 
polynucleotide, and wherein the 3' end of said polynucleotide is located within 20 nucleotides 
upstream of a polymorphism of the invention, chromosome 13q31-q33-related biallelic marker, 
or g35030-related biallelic marker in said sequence; optionally, wherein said polymorphism or 
biallelic marker is selected from the group consisting of Al to A 127, and the complements 
thereof, or optionally the biallelic markers in linkage disequilibrium therewith; optionally, 
wherein said biallelic marker is selected from the group consisting of A 13 to A 18, A20 to A46, 
A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A 123 , and the complements thereof, 
or optionally the biallelic markers in linkage disequilibrium therewith; optionally, wherein said 
biallelic marker is selected from the group consisting of A 13 to A65; optionally, wherein the 3' 
end of said polynucleotide is located 1 nucleotide upstream of said polymorphism or biallelic 
marker; and optionally, wherein said polynucleotide comprises, consists of, or consists 
essentially of a sequence selected from the following sequences: Dl to D127 and El to E127. 

In a further embodiment, the invention encompasses isolated, purified, or recombinant 
polynucleotides comprising, consisting of, or consisting essentially of a sequence selected from 
the following sequences: B 1 to B 1 1 4 and C 1 to C 1 1 4. 

In an additional embodiment, the invention encompasses polynucleotides for use in 
hybridization assays, sequencing assays, and enzyme-based mismatch detection assays for 
determining the identity of the nucleotide at a polymorphism or biallelic marker in any of SEQ 
ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the complement thereof, as well as 
polynucleotides for use in amplifying segments of nucleotides comprising a polymophism or 
biallelic marker of the invention in any of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32 or 
the complement thereof; optionally, wherein said polymorphism or biallelic marker is selected 
from the group consisting of Al to A 127, and the complements thereof, or optionally the 
biallelic markers in linkage disequilibrium therewith; optionally, wherein said biallelic marker 
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is selected from the group consisting of A 13 to A 18, A20 to A46, A49 to A52, A55, A57, A59 
to A63, A72 to A73, A76 and A 1 23 , and the complements thereof, or optionally the biallelic 
markers in linkage disequilibrium therewith; optionally, wherein said biallelic marker is 
selected from the group consisting of A 13 to A65, and the complements thereof, or optionally 
the biallelic markers in linkage disequilibrium therewith. 

These arrays may generally be produced using mechanical synthesis methods or light 
directed synthesis methods, which incorporate a combination of photolithographic methods and 
solid phase oligonucleotide synthesis (Fodor et al., Science, 251:767-777, 1991). The 
immobilization of arrays of oligonucleotides on solid supports has been rendered possible by 
the development of a technology generally identified as "Very Large Scale Immobilized 
Polymer Synthesis" (VLSIPS™) in which, typically, probes are immobilized in a high density 
array on a solid surface of a chip. Examples of VLSIPS™ technologies are provided in US 
Patents 5,143,854 and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1995, which describe methods for forming oligonucleotide arrays through techniques 
such as light-directed synthesis technique. In designing strategies aimed at providing arrays of 
nucleotides immobilized on solid supports, further presentation strategies were developed to 
order and display the oligonucleotide arrays on the chips in an attempt to maximize 
hybridization patterns and sequence information. Examples of such presentation strategies are 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/31256. 

Oligonucleotide arrays may comprise at least one of the sequences selected from the 
group consisting of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132; and the sequences 
complementary thereto or a fragment thereof of at least 8, 10, 12, 15, 1 8, 20, 25, 35, 40, 50, 70, 
80, 100, 250, 500 , 1000 or 2000 consecutive nucleotides, to the extent that fragments of these 
lengths is consistent with the lengths of the particular Sequence ID, for determining whether a 
sample contains one or more alleles of the biallelic markers of the present invention. 
Oligonucleotide arrays may also comprise at least one of the sequences selected from the group 
consisting of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132; and the sequences 
complementary thereto or a fragment thereof of at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 
80, 100, 250, 500 , 1000 or 2000 consecutive nucleotides, to the extent that fragments of these 
lengths is consistent with the lengths of the particular Sequence ID, for amplifying one or more 
alleles of the biallelic markers of Table 5. In other embodiments, arrays may also comprise at 
least one of the sequences selected from the group consisting of SEQ ID Nos. 1 to 17, 24, 25, 
28, 74 and 79 to 132; and the sequences complementary thereto or a fragment thereof of at least 
8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000 consecutive 
nucleotides, to the extent that fragments of these lengths is consistent with the lengths of the 
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particular Sequence ID, for conducting microsequencing analyses to determine whether a 
sample contains one or more alleles of the biallelic markers of the invention. In still further 
embodiments, the oligonucleotide array may comprise at least one of the sequences selecting 
from the group consisting of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132; and the 
sequences complementary thereto or a fragment thereof of at least 8, 10, 12, 15, 18,20,25 35 
40, 50, 70, 80, 1 00, 250, 500 , 1 000 or 2000 nucleotides in length, to the extent that fragments' 
of these lengths is consistent with the lengths of the particular Sequence ID, for determining 
whether a sample contains one or more alleles of the polymorphisms and biallelic markers of 
the present invention. 

A further object of the invention relates to an array of nucleic acid sequences 
comprising either at least one of the sequences selected from the group consisting of PI to 
PI27, BI to Bl 14, CI to CI 14, Dl to D127 El to E127 or the sequences complementary thereto 
or a fragment thereof of at least 8, 10, 12, 15, 18, 20, 25, 30, or 40 consecutive nucleotides 
thereof, or at least one sequence comprising at least 1, 2, 3, 4, 5, 10, 20 biallelic markers 
selected from the group consisting of Al to A127 or the complements thereof. The invention 
also pertains to an array of nucleic acid sequences comprising either at least 1 , 2, 3, 4, 5, 10, 20 
of the sequences selected from the group consisting of PI toP127,Bl toB114,Cl toCll^Dl 
to D127, El to E127 or the sequences complementary thereto or a fragment thereof of at least 8 
consecutive nucleotides thereof, or at least two sequences comprising a biallelic marker selected 
from the group consisting of A 1 to A 127 or the complements thereto. In preferred 
embodiments, said biallelic markers are selected from the group consisting of A13 to A18, A20 
to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 . In preferred 
embodiments, said probes, amplification primers and microsequencing primers correspond in 
Table 6 to said biallelic markers. 

The present invention also encompasses diagnostic kits comprising one or more 
polynucleotides of the invention, optionally with a portion or all of the necessary reagents and 
instructions for genotyping a test subject by determining the identity of a nucleotide at an 
biallelic marker of the invention. The polynucleotides of a kit may optionally be attached to a 
solid support, or be part of an array or addressable array of polynucleotides. The kit may 
provide for the determination of the identity of the nucleotide at a marker position by any 
method known in the art including, but not limited to, a sequencing assay method, a 
microsequencing assay method, a hybridization assay method, or enzyme-based mismatch 
detection assay. Optionally such a kit may include instructions for scoring the results of the 
determination with respect to the test subjects' predisposition to schizophrenia, or likely 
response to an agent acting on schizophrenia, or chances of suffering from side effects to an 
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agent acting on schizophrenia. 

Optionally, in any embodiments described herein, a biallelic marker of the invention 
may comprise: 

(a) a biallelic marker selected from the group consisting of g35030-related markers A13 
to A65; (b) a biallelic marker selected from the group consisting of chromosome 13q31-q33- 
related-biallelic markers A 1 3 to A 1 8, A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to 
A73, A76 and A 123; or (c) a biallelic marker selected from the group consisting of chromosome 
13q31-q33-related-biallelic markers A13 to A18, A20 to A47, A49 to A52, A55, A57, A59 to 
A63. Furthermore, any of the embodiments of the invention may specifically exclude one or 
more of the biallelic markers selected from biallelic markers A 1 to A 1 27. 

Methods For De Novo Identification Of Biallelic Markers 

Any of a variety of methods can be used to screen a genomic fragment for single 
nucleotide polymorphisms such as differential hybridization with oligonucleotide probes, 
detection of changes in the mobility measured by gel electrophoresis or direct sequencing of the 
amplified nucleic acid. A preferred method for identifying biallelic markers involves 
comparative sequencing of genomic DNA fragments from an appropriate number of unrelated 
individuals. 

In a first embodiment, DNA samples from unrelated individuals are pooled together, 
following which the genomic DNA of interest is amplified and sequenced. The nucleotide 
sequences thus obtained are then analyzed to identify significant polymorphisms. One of the 
major advantages of this method resides in the fact that the pooling of the DNA samples 
substantially reduces the number of DNA amplification reactions and sequencing reactions, 
which must be carried out. Moreover, this method is sufficiently sensitive so that a biallelic 
marker obtained thereby usually demonstrates a sufficient frequency of its less common allele 
to be usefiil in conducting association studies. Usually, the frequency of the least common 
allele of a biallelic marker identified by this method is at least 10%. 

In a second embodiment, the DNA samples are not pooled and are therefore amplified 
and sequenced individually. This method is usually preferred when biallelic markers need to be 
identified in order to perform association studies within candidate genes. Preferably, highly 
relevant gene regions such as promoter regions or exon regions may be screened for biallelic 
markers. A biallelic marker obtained using this method may show a lower degree of 
informativeness for conducting association studies, e.g. if the frequency of its less frequent 
allele may be less than about 1 0%. Such a biallelic marker will however be sufficiently 
informative to conduct association studies and it will further be appreciated that including less 
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informative biallelic markers in the genetic analysis studies of the present invention, may allow 
in some cases the direct identification of causal mutations, which may, depending on their 
penetrance, be rare mutations. 

The following is a description of the various parameters of a preferred method used by 
the inventors for the identification of the biallelic markers of the present invention. 

Genomic DNA samples 

The genomic DNA samples from which the biallelic markers of the present invention 
are generated are preferably obtained from unrelated individuals corresponding to a 
heterogeneous population of known ethnic background. The number of individuals from whom 
DNA samples are obtained can vary substantially, preferably from about 10 to about 1000, more 
preferably from about 50 to about 200 individuals. Usually, DNA samples are collected from at 
least about 100 individuals in order to have sufficient polymorphic diversity in a given 
population to identify as many markers as possible and to generate statistically significant 
results. 

As for the source of the genomic DNA to be subjected to analysis, any test sample can 
be foreseen without any particular limitation. These test samples include biological samples, 
which can be tested by the methods of the present invention described herein, and include 
human and animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, 
lymph fluids, and various external secretions of the respiratory, intestinal and genitourinary 
tracts, tears, saliva, milk, white blood cells, myelomas and the like; biological fluids such as cell 
culture supematants; fixed tissue specimens including tumor and non-tumor tissue and lymph 
node tissues; bone marrow aspirates and fixed cell specimens. The preferred source of genomic 
DNA used in the present invention is from peripheral venous blood of each donor. Techniques 
to prepare genomic DNA from biological samples are well known to the skilled technician. 
Details of a preferred embodiment are provided in Example 1. The person skilled in the art can 
choose to amplify pooled or unpooled DNA samples. 

DNA Amplification 

The identification of biallelic markers in a sample of genomic DNA may be facilitated 
through the use of DNA amplification methods. DNA samples can be pooled or unpooled for 
the amplification step. DNA amplification techniques are well known to those skilled in the art. 
Various methods to amplify DNA fragments carrying biallelic markers are further described 
hereinafter herein. The PCR technology is the preferred amplification technique used to 
identify new biallelic markers. 

In a first embodiment, biallelic markers are identified using genomic sequence 
information generated by the inventors. Genomic DNA fragments, such as the inserts of the 
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BAC clones described above, are sequenced and used to design primers for the amplification of 
500 bp fragments. These 500 bp fragments are amplified from genomic DNA and are scanned 
for biallelic markers. Primers may be designed using the OSP software (Hillier L. and Green 
P., 1991). All primers may contain, upstream of the specific target bases, a common 
oligonucleotide tail that serves as a sequencing primer. Those skilled in the art are familiar with 
primer extensions, which can be used for these purposes. 

In another embodiment of the invention, genomic sequences of candidate genes are 
available in public databases allowing direct screening for biallelic markers. Preferred primers, 
useful for the amplification of genomic sequences encoding the candidate genes, focus on 
promoters, exons and splice sites of the genes. A biallelic marker present in these functional 
regions of the gene have a higher probability to be a causal mutation. 

Sequencing Of Amplified Genomic DNA And Identification Of Single Nucleotide 
Polymorphisms 

The amplification products generated as described above, are then sequenced using any 
method known and available to the skilled technician. Methods for sequencing DNA using 
either the dideoxy-mediated method (Sanger method) or the Maxam-Gilbert method are widely 
known to those of ordinary skill in the art. Such methods are for example disclosed in Maniatis 
et al. (Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Second Edition, 
1989). Alternative approaches include hybridization to high-density DNA probe arrays as 
described inCheeetal. (Science 274, 610, 1996). 

Preferably, the amplified DNA is subjected to automated dideoxy terminator 
sequencing reactions using a dye-primer cycle sequencing protocol. The products of the 
sequencing reactions are run on sequencing gels and the sequences are determined using gel 
image analysis. The polymorphism search is based on the presence of superimposed peaks in 
the electrophoresis pattern resulting from different bases occurring at the same position. 
Because each dideoxy terminator is labeled with a different fluorescent molecule, the two peaks 
corresponding to a biallelic site present distinct colors corresponding to two different 
nucleotides at the same position on the sequence. However, the presence of two peaks can be 
an artifact due to background noise. To exclude such an artifact, the two DNA strands are 
sequenced and a comparison between the peaks is carried out. In order to be registered as a 
polymorphic sequence, the polymorphism has to be detected on both strands. 

The above procedure permits those amplification products, which contain biallelic 
markers to be identified. The detection limit for the frequency of biallelic polymorphisms 
detected by sequencing pools of 100 individuals is approximately 0.1 for the minor allele, as 
verified by sequencing pools of known allelic frequencies. However, more than 90% of the 
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biallelic polymorphisms detected by the pooling method have a frequency for the minor allele 
higher than 0.25. Therefore, the biallelic markers selected by this method have a frequency of 
at least 0. 1 for the minor allele and less than 0.9 for the major allele. Preferably at least 0.2 for 
the minor allele and less than 0.8 for the major allele, more preferably at least 0.3 for the minor 
allele and less than 0.7 for the major allele, thus a heterozygosity rate higher than 0. 1 8, 
preferably higher than 0.32, more preferably higher than 0.42. 

In another embodiment, biallelic markers are detected by sequencing individual DNA 
samples, the frequency of the minor allele of such a biallelic marker may be less than 0.1. 

Validation of the biallelic markers of the present invention 

The polymorphisms are evaluated for their usefulness as genetic markers by validating 
that both alleles are present in a population. Validation of the biallelic markers is accomplished 
by genotyping a group of individuals by a method of the invention and demonstrating that both 
alleles are present. Microsequencing is a preferred method of genotyping alleles. The 
validation by genotyping step may be performed on individual samples derived from each 
individual in the group or by genotyping a pooled sample derived from more than one 
individual. The group can be as small as one individual if that individual is heterozygous for 
the allele in question. Preferably the group contains at least three individuals, more preferably 
the group contains five or six individuals, so that a single validation test will be more likely to 
result in the validation of more of the biallelic markers that are being tested. It should be noted, 
however, that when the validation test is performed on a small group it may result in a false 
negative result if as a result of sampling error none of the individuals tested carries one of the 
two alleles. Thus, the validation process is less useful in demonstrating that a particular initial 
result is an artifact, than it is at demonstrating that there is a bona fide biallelic marker at a 
particular position in a sequence. All of the genotyping, haplotyping, association, and 
interaction study methods of the invention may optionally be performed solely with validated 
biallelic markers. 

Evaluation of the frequency of the biallelic markers of the present invention 

The validated biallelic markers are further evaluated for their usefulness as genetic 
markers by determining the frequency of the least common allele at the biallelic marker site. 
The determination of the least common allele is accomplished by genotyping a group of 
individuals by a method of the invention and demonstrating that both alleles are present. This 
determination of frequency by genotyping step may be performed on individual samples derived 
from each individual in the group or by genotyping a pooled sample derived from more than 
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one individual. The group must be large enough to be representative of the population as a 
whole. Preferably the group contains at least 20 individuals, more preferably the group contains 
at least 50 individuals, most preferably the group contains at least 100 individuals. Of course 
the larger the group the greater the accuracy of the frequency determination because of reduced 
sampling error. A biallelic marker wherein the frequency of the less common allele is 30% or 
more is termed a "high quality biallelic marker." All of the genotyping, haplotyping, 
association, and interaction study methods of the invention may optionally be performed solely 
with high quality biallelic markers. 

Another embodiment of the invention comprises methods of estimating the frequency 
of an allele in a population comprising genotyping individuals from said population for a 
13q3 l-q33-related biallelic marker and determining the proportional representation of said 
biallelic marker in said population. In addition, the methods of estimating the frequency of an 
allele in a population encompass methods with any further limitation described in this 
disclosure, or those following, specified alone or in any combination: Optionally, said 13q31- 
q33-related biallelic marker may be in a sequence selected individually or in any combination 
from the group consisting of SEQ Nos 1 to 17, 24, 25, 28, 74 and 79 to 132; and the 
complements thereof; optionally, said 13q31-q33-related biallelic marker may be selected from 
the biallelic markers described in Table 5; optionally, determining the frequency of a biallelic 
marker allele in a population may be accomplished by determining the identity of the 
nucleotides for both copies of said biallelic marker present in the genome of each individual in 
said population and calculating the proportional representation of said nucleotide at said 13q3 1- 
q33-related biallelic marker for the population; optionally, determining the frequency of a 
biallelic marker allele in a population may be accomplished by performing a genotyping method 
on a pooled biological sample derived from a representative number of individuals, or each 
individual, in said population, and calculating the proportional amount of said nucleotide 
compared with the total. 

Methods Of Genotyping An Individual For Biallelic Markers 
Methods are provided to genotype a biological sample for one or more biallelic markers 
of the present invention, all of which may be performed in vitro. Such methods of genotyping 
comprise determining the identity of a nucleotide at an biallelic marker of the invention by any 
method known in the art. These methods find use in genotyping case-control populations in 
association studies as well as individuals in the context of detection of alleles of biallelic 
markers which, are known to be associated with a given trait, in which case both copies of the 
biallelic marker present in individual's genome are determined so that an individual may be 
classified as homozygous or heterozygous for a particular allele. 
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These genotyping methods can be performed nucleic acid samples derived from a single 
individual or pooled DNA samples. 

Genotyping can be performed using similar methods as those described above for the 
identification of the biallelic markers, or using other genotyping methods such as those further 
described below. In preferred embodiments, the comparison of sequences of amplified genomic 
fragments from different individuals is used to identify new biallelic markers whereas 
microsequencing is used for genotyping known biallelic markers in diagnostic and association 
study applications. 

Another embodiment of the invention encompasses methods of genotyping a biological 
sample comprising determining the identity of a nucleotide at a 1 3q3 l-q33-related biallelic 
marker. Thus, the invention encompasses a method of genotyping an individual comprising: (a) 
obtaining a biological sample comprising a nucleic acid from said individual; (b) determining 
the identity of a polymorphic base at a biallelic marker from said nucleic acid; wherein said 
polymorphic base is selected from the group consisting of the first or second respective allele at 
the nucleotide position of the polymorphic base of SEQ ID No 1, 24, 25, 28, or 74 as indicated 
in table 5; wherein the identity of the polymorphic base determines the genotype of the 
individual at said position. 

In addition, the genotyping methods of the invention encompass methods with any 
further limitation described in this disclosure, or those following, specified alone or in any 
combination: Optionally, said 13q3 1 -q33-related biallelic marker may be in a sequence 
selected individually or in any combination from the group consisting of SEQ ID Nos 1 to 17, 
24, 25, 28, 74 and 79 to 132, and the complements thereof; optionally, said 13q3 l-q33-related 
biallelic marker may be selected individually or in any combination from the biallelic markers 
described in Table 5; optionally, said method further comprises determining the identity of a 
25 second nucleotide at said biallelic marker, wherein said first nucleotide and second nucleotide 

are not base paired (by Watson & Crick base pairing) to one another; optionally, said biological 
sample is derived from a single individual or subject; optionally, said method is performed in 
vitro; optionally, said biallelic marker is determined for both copies of said biallelic marker 
present in said individual's genome; optionally, said biological sample is derived from multiple 
subjects or individuals; optionally, said method further comprises amplifying a portion of said 
sequence comprising the biallelic marker prior to said determining step; optionally, wherein 
said amplifying is performed by PCR, LCR, or replication of a recombinant vector comprising 
an origin of replication and said portion in a host cell; optionally, wherein said determining is 
performed by a hybridization assay, sequencing assay, microsequencing assay, or an enzyme- 
35 based mismatch detection assay. 
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Source of DNA for genotyping 

Any source of nucleic acids, in purified or non-purified form, can be utilized as the 
starting nucleic acid, provided it contains or is suspected of containing the specific nucleic acid 
sequence desired. DNA or RNA may be extracted from cells, tissues, body fluids and the like 
as described herein. While nucleic acids for use in the genotyping methods of the invention can 
be derived from any mammalian source, the test subjects and individuals from which nucleic 
acid samples are taken are generally understood to be human. 

Amplification Of DNA Fragments Comprising Biallelic Markers 
Methods and polynucleotides are provided to amplify a segment of nucleotides 
comprising one or more biallelic marker of the present invention. It will be appreciated that 
amplification of DNA fragments comprising biallelic markers may be used in various methods 
and for various purposes and is not restricted to genotyping. Nevertheless, many genotyping 
methods, although not all, require the previous amplification of the DNA region carrying the 
biallelic marker of interest Such methods specifically increase the concentration or total 
number of sequences that span the biallelic marker or include that site and sequences located 
either distal or proximal to it. Diagnostic assays may also rely on amplification of DNA 
segments carrying a biallelic marker of the present invention. 

Amplification of DNA may be achieved by any method known in the art. The 
established PCR (polymerase chain reaction) method or by developments thereof or 
alternatives. Amplification methods which can be utilized herein include but are not limited to 
Ligase Chain Reaction (LCR) as described in EP A 320 308 and EP A 439 182, Gap LCR 
(Wolcott, M.J.), the so-called "NASBA" or "3SR" technique described in Guatelli J.C. et al. 
(1990) and in Compton J. (1991), Q-beta amplification as described in EP A 4544 610, strand 
displacement amplification as described in Walker et al. (1996) and EP A 684 3 15 and, target 
mediated amplification as described in PCT Publication WO 9322461. 

LCR and Gap LCR are exponential amplification techniques, both depend on DNA 
ligase to join adjacent primers annealed to a DNA molecule. In Ligase Chain Reaction (LCR), 
probe pairs are used which include two primary (first and second) and two secondary (third and 
fourth) probes, all of which are employed in molar excess to target. The first probe hybridizes 
to a first segment of the target strand and the second probe hybridizes to a second segment of 
the target strand, the first and second segments being contiguous so that the primary probes abut 
one another in 5' phosphate-3'hydroxyl relationship, and so that a ligase can covalently fuse or 
ligate the two probes into a fused product. In addition, a third (secondary) probe can hybridize 
to a portion of the first probe and a fourth (secondary) probe can hybridize to a portion of the 
second probe in a similar abutting fashion. Of course, if the target is initially double stranded, 
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the secondary probes also will hybridize to the target complement in the first instance. Once the 
ligated strand of primary probes is separated from the target strand, it will hybridize with the 
third and fourth probes which can be ligated to form a complementary, secondary ligated 
product. It is important to realize that the ligated products are functionally equivalent to either 
the target or its complement. By repeated cycles of hybridization and ligation, amplification of 
the target sequence is achieved. A method for multiplex LCR has also been described (WO 
9320227). Gap LCR (GLCR) is a version of LCR where the probes are not adjacent but are 
separated by 2 to 3 bases. 

For amplification of mRNAs, it is within the scope of the present invention to reverse 
transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 
single enzyme for both steps as described in U.S. Patent No. 5,322,770 or, to use Asymmetric 
Gap LCR (RT-AGLCR) as described by Marshall R.L. et al. (1994). AGLCR is a modification 
of GLCR that allows the amplification of RNA. 

Some of these amplification methods are particularly suited for the detection of single 
nucleotide polymorphisms and allow the simultaneous amplification of a target sequence and 
the identification of the polymorphic nucleotide as it is further described herein. 

The PCR technology is the preferred amplification technique used in the present 
invention. A variety of PCR techniques are familiar to those skilled in the art. For a review of 
PCR technology, see Molecular Cloning to Genetic Engineering White, B.A. Ed. (1997) and the 
publication entitled "PCR Methods and Applications" (1991, Cold Spring Harbor Laboratory 
Press). In each of these PCR procedures, PCR primers on either side of the nucleic acid 
sequences to be amplified are added to a suitably prepared nucleic acid sample along with 
dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent 
polymerase. The nucleic acid in the sample is denatured and the PCR primers are specifically 
hybridized to complementary nucleic acid sequences in the sample. The hybridized primers are 
extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. 
The cycles are repeated multiple times to produce an amplified fragment containing the nucleic 
acid sequence between the primer sites. PCR has fiirther been described in several patents 
including US Patents 4,683,195, 4,683,202 and 4,965,188. 

Primers can be prepared by any suitable method. As for example, direct chemical 
synthesis by a method such as the phosphodiester method of Narang S.A. et al. (1979), the 
phosphodiester method of Brown E.L. et al. (1979), the diethylphosphoramidite method of 
Beaucage et al. (1981) and the solid support method described in EP 0 707 592. 

In some embodiments the present invention provides primers for amplifying a DNA 
fragment containing one or more biallelic markers of the present invention. It will be 
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appreciated that the primers listed are merely exemplary and that any other set of primers which 
produce amplification products containing one or more biallelic markers of the present 

invention. 

The spacing of the primers determines the length of the segment to be amplified. In the 
context of the present invention amplified segments carrying biallelic markers can range in size 
from at least about 25 bp to 35 kbp. Amplification fragments from 25-3000 bp are typical, 
fragments from 50-1000 bp are preferred and fragments from 100-600 bp are highly preferred. 
It will be appreciated that amplification primers for the biallelic markers may be any sequence 
which allow the specific amplification of any DNA fragment carrying the markers. 
Amplification primers may be labeled or immobilized on a solid support as described in the 
section titled "Oligonucleotide Probes and Primers". 

Methods of Genotvping DNA samples for BUUdlc Markers 
Any method known in the art can be used to identify the nucleotide present at a biallelic 
marker site. Since the biallelic marker allele to be detected has been identified and specified in 
the present invention, detection will prove simple for one of ordinary skill in the art by 
employing any of a number of techniques. Many genotyping methods require the previous 
amplification of the DNA region carrying the biallelic marker of interest. While the 
amplification of target or signal is often preferred at present, ultrasensitive detection methods 
which do not require amplification are also encompassed by the present genotyping methods. 
Methods well-known to those skilled in the art that can be used to detect biallelic 
polymorphisms include methods such as, conventional dot blot analyzes, single strand 
conformational polymorphism analysis (SSCP) described by Orita et al. (1989), denaturing 
gradient gel electrophoresis (DGGE), heteroduplex analysis, mismatch cleavage detection, and 
other conventional techniques as described in Sheffield, V.C. et al. (1991), White et al. (1992), 
Grompe, M. et al. ( 1 989) and Grompe, M. ( 1 993). Another method for determining the identity 
of the nucleotide present at a particular polymorphic site employs a specialized exonuclease- 
resistant nucleotide derivative as described in US patent 4,656, 127. 

Preferred methods involve directly determining the identity of the nucleotide present at 
a biallelic marker site by sequencing assay, enzyme-based mismatch detection assay, or 
hybridization assay. The following is a description of some preferred methods. A highly 
preferred method is the microsequencing technique. The term "sequencing assay" is used 
herein to refer to polymerase extension of duplex primer/template complexes and includes both 
traditional sequencing and microsequencing. 
1) Sequencing assays 

The nucleotide present at a polymorphic site can be determined by sequencing methods. 
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In a preferred embodiment, DN A samples are subjected to PCR amplification before 
sequencing as described above. DNA sequencing methods are described in herein. Preferably, 
the amplified DNA is subjected to automated dideoxy terminator sequencing reactions using a 
dye-primer cycle sequencing protocol. Sequence analysis allows the identification of the base 
present at the biallelic marker site. 

2) Microsequencing assays 

In microsequencing methods, a nucleotide at the polymorphic site that is unique to one 
of the alleles in a target DNA is detected by a single nucleotide primer extension reaction. This 
method involves appropriate microsequencing primers which, hybridize just upstream of a 
polymorphic base of interest in the target nucleic acid. A polymerase is used to specifically 
extend the 3* end of the primer with one single ddNTP (chain terminator) complementary to the 
selected nucleotide at the polymorphic site. Next the identity of the incorporated nucleotide is 
determined in any suitable way. 

Typically, microsequencing reactions are carried out using fluorescent ddNTPs and the 
extended microsequencing primers are analyzed by electrophoresis on ABI 377 sequencing 
machines to determine the identity of the incorporated nucleotide as described in EP 412 883. 
Alternatively capillary electrophoresis can be used in order to process a higher number of assays 
simultaneously. An example of a typical microsequencing procedure that can be used in the 
context of the present invention is provided in example 4. 

Different approaches can be used to detect the nucleotide added to the microsequencing 
primer. A homogeneous phase detection method based on fluorescence resonance energy 
transfer has been described by Chen and Kwok (1997) and Chen et ai. (1997). In this method 
amplified genomic DNA fragments containing polymorphic sites are incubated with a 5'- 
fluorescein-labeled primer in the presence of allelic dye-labeled dideoxyribonucleoside 
triphosphates and a modified Taq polymerase. The dye-labeled primer is extended one base by 
the dye-terminator specific for the allele present on the template. At the end of the genotyping 
reaction, the fluorescence intensities of the two dyes in the reaction mixture are analyzed 
directly without separation or purification. All these steps can be performed in the same tube 
and the fluorescence changes can be monitored in real time. Alternatively, the extended primer 
may be analyzed by MALDI-TOF Mass Spectrometry. The base at the polymorphic site is 
identified by the mass added onto the microsequencing primer (see HafTL.A. and Smirnov LP., 
1997). 

Microsequencing may be achieved by the established microsequencing method or by 
developments or derivatives thereof. Alternative methods include several solid-phase 
microsequencing techniques. The basic microsequencing protocol is the same as described 
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previously, except that the method is conducted as a heterogenous phase assay, in which the 
primer or the target molecule is immobilized or captured onto a solid support. To simplify the 
primer separation and the terminal nucleotide addition analysis, oligonucleotides are attached to 
solid supports or are modified in such ways that permit affinity separation as well as polymerase 
extension. The 5 1 ends and internal nucleotides of synthetic oligonucleotides can be modified in 
a number of different ways to permit different affinity separation approaches, e.g., biotinylation. 
If a single affinity group is used on the oligonucleotides, the oligonucleotides can be separated 
from the incorporated terminator regent. This eliminates the need of physical or size separation. 
More than one oligonucleotide can be separated from the terminator reagent and analyzed 
simultaneously if more than one affinity group is used. This permits the analysis of several 
nucleic acid species or more nucleic acid sequence information per extension reaction. The 
affinity group need not be on the priming oligonucleotide but could alternatively be present on 
the template. For example, immobilization can be carried out via an interaction between 
biotinylated DNA and streptavidin-coated microtitration wells or avidin-coated polystyrene 
particles. In the same manner oligonucleotides or templates may be attached to a solid support 
in a high-density format. In such solid phase microsequencing reactions, incorporated ddNTPs 
can be radiolabeled (Syvanen, 1994) or linked to fluorescein (Livak and Hainer, 1994). The 
detection of radiolabeled ddNTPs can be achieved through scintillation-based techniques. The 
detection of fluorescein-linked ddNTPs can be based on the binding of antifluorescein antibody 
conjugated with alkaline phosphatase, followed by incubation with a chromogenic substrate 
(such as/7-nitrophenyI phosphate). Other possible reporter-detection pairs include: ddNTP 
linked to dinitrophenyl (DNP) and anti-DNP alkaline phosphatase conjugate (Harju et al., 1993) 
or biotinylated ddNTP and horseradish peroxidase-conjugated streptavidin with o- 
phenylenediamine as a substrate (WO 92/15712). As yet another alternative solid-phase 
microsequencing procedure, Nyren et al. (1993) described a method relying on the detection of 
DNA polymerase activity by an enzymatic luminometric inorganic pyrophosphate detection 
assay (ELIDA). 

Pastinen et al. (1997), describe a method for multiplex detection of single nucleotide 
polymorphism in which the solid phase minisequencing principle is applied to an 
oligonucleotide array format. High-density arrays of DNA probes attached to a solid support 
(DNA chips) are further described in herein. 

In one aspect the present invention provides polynucleotides and methods to genotype 
one or more biallelic markers of the present invention by performing a microsequencing assay. 
Preferred microsequencing primers include those being featured Table 6. It will be appreciated 
that the microsequencing primers listed in Table 6 are merely exemplary and that, any primer 
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having a 3' end immediately adjacent to a polymorphic nucleotide may be used. Similarly, it 
will be appreciated that microsequencing analysis may be performed for any biallelic marker or 
any combination of biallelic markers of the present invention. One aspect of the present 
invention is a solid support which includes one or more microsequencing primers listed in Table 
6, or fragments comprising at least 8, at least 12, at least 1 5, or at least 20 consecutive 
nucleotides thereof and having a 3 r terminus immediately upstream of the corresponding 
biallelic marker, for determining the identity of a nucleotide at biallelic marker site. 
3) Mismatch detection assays based on polymerases and ligases 
In one aspect the present invention provides polynucleotides and methods to determine 
the allele of one or more biallelic markers of the present invention in a biological sample, by 
mismatch detection assays based on polymerases and/or ligases. These assays are based on the 
specificity of polymerases and ligases. Polymerization reactions places particularly stringent 
requirements on correct base pairing of the 3' end of the amplification primer and the joining of 
two oligonucleotides hybridized to a target DNA sequence is quite sensitive to mismatches 
close to the ligation site, especially at the 3' end. The terms "enzyme based mismatch detection 
assay" are used herein to refer to any method of determining the allele of a biallelic marker 
based on the specificity of ligases and polymerases. Preferred methods are described below. 
Methods, primers and various parameters to amplify DNA fragments comprising biallelic 
markers of the present invention are farther described herein. 
Allele specific amplification 

Discrimination between the two alleles of a biallelic marker can also be achieved by 
allele specific amplification, a selective strategy, whereby one of the alleles is amplified without 
amplification of the other allele. This is accomplished by placing a polymorphic base at the 3* 
end of one of the amplification primers. Because the extension forms from the 3'end of the 
primer, a mismatch at or near this position has an inhibitory effect on amplification. Therefore, 
under appropriate amplification conditions, these primers only direct amplification on their 
complementary allele. Designing the appropriate allele-specific primer and the corresponding 
assay conditions are well with the ordinary skill in the art. 

Ligation/amplification based methods 

The "Oligonucleotide Ligation Assay" (OLA) uses two oligonucleotides which are 
designed to be capable of hybridizing to abutting sequences of a single strand of a target 
molecules. One of the oligonucleotides is biotinylated, and the other is detectably labeled. If 
the precise complementary sequence is found in a target molecule, the oligonucleotides will 
hybridize such that their termini abut, and create a ligation substrate that can be captured and 
detected. OLA is capable of detecting biallelic markers and may be advantageously combined 
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with PCR as described by Nickerson D.A. et al. (1990). In this method, PCR is used to achieve 
the exponential amplification of target DNA, which is then detected using OLA. 

Other methods which are particularly suited for the detection of biallelic markers 
include LCR (ligase chain reaction), Gap LCR (GLCR) which are described herein. As 
mentioned above LCR uses two pairs of probes to exponentially amplify a specific target. The 
sequences of each pair of oligonucleotides, is selected to permit the pair to hybridize to abutting 
sequences of the same strand of the target. Such hybridization forms a substrate for a template- 
dependant ligase. In accordance with the present invention, LCR can be performed with 
oligonucleotides having the proximal and distal sequences of the same strand of a biallelic 
marker site. In one embodiment, either oligonucleotide will be designed to include the biallelic 
marker site. In such an embodiment, the reaction conditions are selected such that the 
oligonucleotides can be ligated together only if the target molecule either contains or lacks the 
specific nucleotide(s) that is complementary to the biallelic marker on the oligonucleotide. In 
an alternative embodiment, the oligonucleotides will not include the biallelic marker, such that 
when they hybridize to the target molecule, a "gap" is created as described in WO 90/0 1 069. 
his gap is then "filled" with complementary dNTPs (as mediated by DNA polymerase), or by an 
additional pair of oligonucleotides. Thus at the end of each cycle, each single strand has a 
complement capable of serving as a target during the next cycle and exponential allele-speciflc 
amplification of the desired sequence is obtained. 

Ligase/Polymerase-mediated Genetic Bit Analysis™ is another method for determining 
the identity of a nucleotide at a preselected site in a nucleic acid molecule (WO 95/21271). This 
method involves the incorporation of a nucleoside triphosphate that is complementary to the 
nucleotide present at the preselected site onto the terminus of a primer molecule, and their 
subsequent ligation to a second oligonucleotide. The reaction is monitored by detecting a 
specific label attached to the reaction's solid phase or by detection in solution. 

4) Hybridization assay methods 

A preferred method of determining the identity of the nucleotide present at a biallelic 
marker site involves nucleic acid hybridization. The hybridization probes, which can be 
conveniently used in such reactions, preferably include the probes defined herein. Any 
hybridization assay may be used including Southern hybridization, Northern hybridization, dot 
blot hybridization and solid-phase hybridization (see Sambrook et al., Molecular Cloning - A 
Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., 1989). 

Hybridization refers to the formation of a duplex structure by two single stranded 
nucleic acids due to complementary base pairing. Hybridization can occur between exactly 
complementary nucleic acid strands or between nucleic acid strands that contain minor regions 
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of mismatch. Specific probes can be designed that hybridize to one form of a biallelic marker 
and not to the other and therefore are able to discriminate between different allelic forms. 
Allele-speciflc probes are often used in pairs, one member of a pair showing perfect match to a 
target sequence containing the original allele and the other showing a perfect match to the target 
sequence containing the alternative allele. Hybridization conditions should be sufficiently 
stringent that there is a significant difference in hybridization intensity between alleles, and 
preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles. 
Stringent, sequence specific hybridization conditions, under which a probe will hybridize only 
to the exactly complementary target sequence are well known in the art (Sambrook et al., 
Molecular Cloning - A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., 
1989). Stringent conditions are sequence dependent and will be different in different 
circumstances. Generally, stringent conditions are selected to be about 5°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. By 
way of example and not limitation, procedures using conditions of high stringency are as 
follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65°C in 
buffer composed of 6X SSC, 50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 ng/ml denatured salmon sperm DNA. Filters are hybridized for 
48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 
100 pg/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. 
Alternatively, the hybridization step can be performed at 65°C in the presence of SSC buffer, 1 
x SSC corresponding to 0. 1 5M NaCI and 0.05 M Na citrate. Subsequently, filter washes can be 
done at 37°C for 1 h in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% 
BSA, followed by a wash in 0. 1 X SSC at 50°C for 45 min. Alternatively, filter washes can be 
performed in a solution containing 2 x SSC and 0. 1% SDS, or 0.5 x SSC and 0.1% SDS, or 0. 1 
x SSC and 0.1% SDS at 68°C for 15 minute intervals. Following the wash steps, the hybridized 
probes are detectable by autoradiography. By way of example and not limitation, procedures 
using conditions of intermediate stringency are as follows: Filters containing DNA are 
prehybridized, and then hybridized at a temperature of 60°C in the presence of a 5 x SSC buffer 
and labeled probe. Subsequently, filters washes are performed in a solution containing 2x SSC 
at 50°C and the hybridized probes are detectable by autoradiography. Other conditions of high 
and intermediate stringency which may be used are well known in the art and as cited in 
Sambrook et al. (Molecular Cloning - A Laboratory Manual, Second Edition, Cold Spring 
Harbor Press, N.Y., 1989) and Ausubel et al. (Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y., 1989). 

Although such hybridizations can be performed in solution, it is preferred to employ a 
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solid-phase hybridization assay. The target DNA comprising a biallelic marker of the present 
invention may be amplified prior to the hybridization reaction. The presence of a specific allele 
in the sample is determined by detecting the presence or the absence of stable hybrid duplexes 
formed between the probe and the target DNA. The detection of hybrid duplexes can be carried 
out by a number of methods. Various detection assay formats are well known which utilize 
detectable labels bound to either the target or the probe to enable detection of the hybrid 
duplexes. Typically, hybridization duplexes are separated from unhybridized nucleic acids and 
the labels bound to the duplexes are then detected. Those skilled in the art will recognize that 
wash steps may be employed to wash away excess target DNA or probe. Standard 
heterogeneous assay formats are suitable for detecting the hybrids using the labels present on 
the primers and probes. 

Two recently developed assays allow hybridization-based allele discrimination with no 
need for separations or washes (see Landegren U. et ah, 1 998). The TaqMan assay takes 
advantage of the 5' nuclease activity of Taq DNA polymerase to digest a DNA probe annealed 
specifically to the accumulating amplification product. TaqMan probes are labeled with a 
donor-acceptor dye pair that interacts via fluorescence energy transfer. Cleavage of the 
TaqMan probe by the advancing polymerase during amplification dissociates the donor dye 
from the quenching acceptor dye, greatly increasing the donor fluorescence. All reagents 
necessary to detect two allelic variants can be assembled at the beginning of the reaction and the 
results are monitored in real time (see Livak et al, 1995). In an alternative homogeneous 
hybridization-based procedure, molecular beacons are used for allele discriminations. 
Molecular beacons are hairpin-shaped oligonucleotide probes that report the presence of 
specific nucleic acids in homogeneous solutions. When they bind to their targets they undergo a 
conformational reorganization that restores the fluorescence of an internally quenched 
fluorophore (Tyagi et al., 1998). 

By assaying the hybridization to an allele specific probe, one can detect the presence or 
absence of a biallelic marker allele in a given sample. 

High-Throughput parallel hybridizations in array format are specifically encompassed 
within "hybridization assays'* and are described below. 

Hybridization to addressable arrays of oligonucleotides 
Hybridization assays based on oligonucleotide arrays rely on the differences in 
hybridization stability of short oligonucleotides to perfectly matched and mismatched target 
sequence variants. Efficient access to polymorphism information is obtained through a basic 
structure comprising high-density arrays of oligonucleotide probes attached to a solid support 
(the chip) at selected positions. Each DNA chip can contain thousands to millions of individual 
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synthetic DNA probes arranged in a grid-like pattern and miniaturized to the size of a dime. 

The chip technology has already been applied with success in numerous cases. For 
example, the screening of mutations has been undertaken in the BRCA1 gene, in S. cerevisiae 
mutant strains, and in the protease gene of H1V-1 virus (Hacia et al., 1996; Shoemaker et a!, 
1996 ; Kozal et al., 1996). Chips of various formats for use in detecting biallelic 
polymorphisms can be produced on a customized basis by Affymetrix (GeneChip™), Hyseq 
(HyChip and HyGnostics), and Protogene Laboratories. 

In general, these methods employ arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual which, target 
sequences include a polymorphic marker. EP785280, describes a tiling strategy for the 
detection of single nucleotide polymorphisms. Briefly, arrays may generally be "tiled" for a 
large number of specific polymorphisms. By filing" is generally meant the synthesis of a 
defined set of oligonucleotide probes which is made up of a sequence complementary to the 
target sequence of interest, as well as preselected variations of that sequence, e.g., substitution 
of one or more given positions with one or more members of the basis set of monomers, i.e. 
nucleotides. Tiling strategies are further described in PCT application No. WO 95/1 1995. In a 
particular aspect, arrays are tiled for a number of specific, identified biallelic marker sequences. 
In particular the array is tiled to include a number of detection blocks, each detection block 
being specific for a specific biallelic marker or a set of biallelic markers. For example, a 
detection block may be tiled to include a number of probes, which span the sequence segment 
that includes a specific polymorphism. To ensure probes that are complementary to each allele, 
the probes are synthesized in pairs differing at the biallelic marker. In addition to the probes 
differing at the polymorphic base, monosubstituted probes are also generally tiled within the 
detection block. These monosubstituted probes have bases at and up to a certain number of 
bases in either direction from the polymorphism, substituted with the remaining nucleotides 
(selected from A, T, G, C and U). Typically the probes in a tiled detection block will include 
substitutions of the sequence positions up to and including those that are 5 bases away from the 
biallelic marker. The monosubstituted probes provide internal controls for the tiled array, to 
distinguish actual hybridization from artefactual cross-hybridization. Upon completion of 
hybridization with the target sequence and washing of the array, the array is scanned to 
determine the position on the array to which the target sequence hybridizes. The hybridization 
data from the scanned array is then analyzed to identify which allele or alleles of the biallelic 
marker are present in the sample. Hybridization and scanning may be carried out as described 
in PCT application No. WO 92/10092 and WO 95/1 1995 and US patent No. 5,424,186. 

Thus, in some embodiments, the chips may comprise an array of nucleic acid sequences 
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of fragments of about 15 nucleotides in length. In further embodiments, the chip may comprise 
an array including at least one of the sequences selected from the group consisting of SEQ ID 
Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 and the sequences complementary thereto, or a 
fragment thereof at least about 8 consecutive nucleotides, preferably 10, 1 5, 20, more preferably 
25, 30, 40, 47, or 50 consecutive nucleotides. In some embodiments, the chip may comprise an 
array of at least 2, 3, 4, 5, 6, 7, 8 or more of these polynucleotides of the invention. Solid 
supports and polynucleotides of the present invention attached to solid supports are further 
described in the section titled "Oligonucleotide probes and Primers". 
5) Integrated Systems 

Another technique, which may be used to analyze polymorphisms, includes 
multicomponent integrated systems, which miniaturize and compartmentalize processes such as 
PCR and capillary electrophoresis reactions in a single functional device. An example of such 
technique is disclosed in US patent 5,589,136, which describes the integration of PCR 
amplification and capillary electrophoresis in chips. 

Integrated systems can be envisaged mainly when microfluidic systems are used. These 
systems comprise a pattern of microchannels designed onto a glass, silicon, quartz, or plastic 
wafer included on a microchip. The movements of the samples are controlled by electric, 
electroosmotic or hydrostatic forces applied across different areas of the microchip. For 
genotyping biallelic markers, the microfluidic system may integrate nucleic acid amplification, 
microsequencing, capillary electrophoresis and a detection method such as laser-induced 
fluorescence detection. 



Methods Of G enetic Analysis Using The Biallelic Markers Of The Present 
Invention 

Different methods are available for the genetic analysis of complex traits (see Lander 
and Schork, 1994). The search for disease-susceptibility genes is conducted using two main 
methods: the linkage approach in which evidence is sought for cosegregation between a locus 
and a putative trait locus using family studies, and the association approach in which evidence is 
sought for a statistically significant association between an allele and a trait or a trait causing 
allele (Khoury J. etal, 1993). In general, the biallelic markers of the present invention find use 
in any method known in the art to demonstrate a statistically significant correlation between a 
genotype and a phenotype. The biallelic markers may be used in parametric and non-parametric 
linkage analysis methods. Preferably, the biallelic markers of the present invention are used to 
identify genes associated with detectable traits using association studies, an approach which 
does not require the use of affected families and which permits the identification of genes 
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associated with complex and sporadic traits. 

The genetic analysis using the biallelic markers of the present invention may be 
conducted on any scale. The whole set of biallelic markers of the present invention or any 
subset of biallelic markers of the present invention may be used. In some embodiments a subset 
of biallelic markers corresponding to one or several candidate genes of the present invention 
may be used. Alternatively, a subset of biallelic markers of the present invention localised on a 
specific chromosome segment may be used. Further, any set of genetic markers including a 
biallelic marker of the present invention may be used. As mentioned above, it should be noted 
that the biallelic markers of the present invention may be included in any complete or partial 
genetic map of the human genome. These different uses are specifically contemplated in the 
present invention and claims. 

Linkage analysis 

Linkage analysis is based upon establishing a correlation between the transmission of 
genetic markers and that of a specific trait throughout generations within a family. Thus, the 
aim of linkage analysis is to detect marker loci that show cosegregation with a trait of interest in 
pedigrees. 

Parametric methods 

When data are available from successive generations there is the opportunity to study 
the degree of linkage between pairs of loci. Estimates of the recombination fraction enable loci 
to be ordered and placed onto a genetic map. With loci that are genetic markers, a genetic map 
can be established, and then the strength of linkage between markers and traits can be calculated 
and used to indicate the relative positions of markers and genes affecting those traits (Weir, 
B.S., 1996). The classical method for linkage analysis is the logarithm of odds (lod) score 
method (see Morton N.E., 1955; Ott J, 1991). Calculation of lod scores requires specification of 
the mode of inheritance for the disease (parametric method). Generally, the length of the 
candidate region identified using linkage analysis is between 2 and 20Mb. Once a candidate 
region is identified as described above, analysis of recombinant individuals using additional 
markers allows further delineation of the candidate region. Linkage analysis studies have 
generally relied on the use of a maximum of 5,000 microsatellite markers, thus limiting the 
maximum theoretical attainable resolution of linkage analysis to about 600 kb on average. 

Linkage analysis has been successfully applied to map simple genetic traits that show 
clear Mendelian inheritance patterns and which have a high penetrance (i.e., the ratio between 
the number of trait positive carriers of allele a and the total number of a carriers in the 
population). However, parametric linkage analysis suffers from a variety of drawbacks. First, it 
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is limited by its reliance on the choice of a genetic model suitable for each studied trait. 
Furthermore, as already mentioned, the resolution attainable using linkage analysis is limited, 
and complementary studies are required to refine the analysis of the typical 2Mb to 20Mb 
regions initially identified through linkage analysis. In addition, parametric linkage analysis 
approaches have proven difficult when applied to complex genetic traits, such as those due to 
the combined action of multiple genes and/or environmental factors. It is very difficult to 
model these factors adequately in a lod score analysis. In such cases, too large an effort and 
cost are needed to recruit the adequate number of affected families required for applying linkage 
analysis to these situations, as recently discussed by Risch, N. and Merikangas, K. (1996). 
Non-parametric methods 

The advantage of the so-called non-parametric methods for linkage analysis is that they 
do not require specification of the mode of inheritance for the disease, they tend to be more 
useful for the analysis of complex traits. In non-parametric methods, one tries to prove that the 
inheritance pattern of a chromosomal region is not consistent with random Mendelian 
segregation by showing that affected relatives inherit identical copies of the region more often 
than expected by chance. Affected relatives should show excess "allele sharing" even in the 
presence of incomplete penetrance and polygenic inheritance. In non-parametric linkage 
analysis the degree of agreement at a marker locus in two individuals can be measured either by 
the number of alleles identical by state (IBS) or by the number of alleles identical by descent 
(IBD). Affected sib pair analysis is a well-known special case and is the simplest form of these 
methods. 

The biallelic markers of the present invention may be used in both parametric and non- 
parametric linkage analysis. Preferably biallelic markers may be used in non-parametric 
methods which allow the mapping of genes involved in complex traits. The biallelic markers of 
the present invention may be used in both IBD- and IBS- methods to map genes affecting a 
complex trait. In such studies, taking advantage of the high density of biallelic markers, several 
adjacent biallelic marker loci may be pooled to achieve the efficiency attained by multi-allelic 
markers (Zhao et al., 1998). 

However, both parametric and non-parametric linkage analysis methods analyse 
affected relatives, they tend to be of limited value in the genetic analysis of drug responses or in 
the analysis of side effects to treatments. This type of analysis is impractical in such cases due 
to the lack of availability of familial cases. In fact, the likelihood of having more than one 
individual in a family being exposed to the same drug at the same time is extremely low. 

Population Association Studies 

The present invention comprises methods for identifying one or several genes among a 
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set of candidate genes that are associated with a detectable trait using the biallelic markers of 
the present invention. In one embodiment the present invention comprises methods to detect an 
association between a biallelic marker allele or a biallelic marker haplotype and a trait. Further, 
the invention comprises methods to identify a trait causing allele in linkage disequilibrium with 
any biallelic marker allele of the present invention. 

As described above, alternative approaches can be employed to perform association 
studies: genome-wide association studies, candidate region association studies and candidate 
gene association studies. The candidate region analysis clearly provides a short-cut approach to 
the identification of genes and gene polymorphisms related to a particular trait when some 
information concerning the biology of the trait is available. Further, the biallelic markers of the 
present invention may be incorporated in any map of genetic markers of the human genome in 
order to perform genome-wide association studies. Methods to generate a high-density map of 
biallelic markers has been described in US Provisional Patent application serial number 
60/082,614. The biallelic markers of the present invention may further be incorporated in any 
map of a specific candidate region of the genome (a specific chromosome or a specific 
chromosomal segment for example). 

As mentioned above, association studies may be conducted within the general 
population and are not limited to studies performed on related individuals in affected families. 
Association studies are extremely valuable as they permit the analysis of sporadic or multifactor 
traits. Moreover, association studies represent a powerful method for fine-scale mapping 
enabling much finer mapping of trait causing alleles than linkage studies. Studies based on 
pedigrees often only narrow the location of the trait causing allele. Association studies using 
the biallelic markers of the present invention can therefore be used to refine the location of a 
trait causing allele in a candidate region identified by Linkage Analysis methods. Biallelic 
markers of the present invention can be used to identify the involved gene; such uses are 
specifically contemplated in the present invention and claims. 

1) Determining the frequency of a biallelic marker allele or of a biallelic marker 
haplotype in a population 

Another embodiment of the present invention encompasses methods of estimating the 
frequency of a haplotype for a set of biallelic markers in a population, comprising the steps of: 
a) genotyping each individual in said population for at least one 13q31-q33 -related biallelic 
marker, b) genotyping each individual in said population for a second biallelic marker by 
determining the identity of the nucleotides at said second biallelic marker for both copies of said 
second biallelic marker present in the genome; and c) applying a haplotype determination 
method to the identities of the nucleotides determined in steps a) and b) to obtain an estimate of 
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said frequency. In addition, the methods of estimating the frequency of a haplotype of the 
invention encompass methods with any further limitation described in this disclosure, or those 
following, specified alone or in any combination: optionally said haplotype determination 
method is selected from the group consisting of asymmetric PCR amplification, double PCR 
amplification of specific alleles, the Clark method, or an expectation maximization algorithm; 
optionally, said second biallelic marker is a 1 3q3 l-q33-related biallelic marker in a sequence 
selected from the group consisting of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32, and the 
complements thereof; optionally, said 13q3 l-q33-related biallelic marker may be selected 
individually or in any combination from the biallelic markers described in Tables 6b and 6c; 
optionally, the identity of the nucleotides at the biallelic markers in everyone of the sequences 
of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132 is determined in steps a) and b). 

Association studies explore the relationships among frequencies for sets of alleles 
between loci. 

Determining the frequency of an allele in a population 

Allelic frequencies of the biallelic markers in a population can be determined using one 
of the methods described above under the heading "Methods for genotyping an individual for 
biallelic markers", or any genotyping procedure suitable for this intended purpose. Genotyping 
pooled samples or individual samples can determine the frequency of a biallelic marker allele in 
a population. One way to reduce the number of genotypings required is to use pooled samples. 
A major obstacle in using pooled samples is in terms of accuracy and reproducibility for 
determining accurate DNA concentrations in setting up the pools. Genotyping individual 
samples provides higher sensitivity, reproducibility and accuracy and; is the preferred method 
used in the present invention. Preferably, each individual is genotyped separately and simple 
gene counting is applied to determine the frequency of an allele of a biallelic marker or of a 
genotype in a given population. 

Determining the frequency of a haplotype in a population 
The gametic phase of haplotypes is unknown when diploid individuals are 
heterozygous at more than one locus. Using genealogical information in families gametic phase 
can sometimes be inferred (Perlin et al., 1994). When no genealogical information is available 
different strategies may be used. One possibility is that the multiple-site heterozygous diploids 
can be eliminated from the analysis, keeping only the homozygotes and the single-site 
heterozygote individuals, but this approach might lead to a possible bias in the sample 
composition and the underestimation of low-frequency haplotypes. Another possibility is that 
single chromosomes can be studied independently, for example, by asymmetric PCR 
amplification (see Newton et al., 1989; Wu et al., 1989) or by isolation of single chromosome 
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by limit dilution followed by PCR amplification (see Ruano et al., 1990). Further, a sample may 
be haplotyped for sufficiently close biallelic markers by double PCR amplification of specific 
alleles (Sarkar, G. and Sommer S.S., 1 99 1 ). These approaches are not entirely satisfying either 
because of their technical complexity, the additional cost they entail, their lack of generalisation 
at a large scale, or the possible biases they introduce. To overcome these difficulties, an 
algorithm to infer the phase of PCR-amplified DNA genotypes introduced by Clark A.G. (1990) 
may be used. Briefly, the principle is to start filling a preliminary list of haplotypes present in 
the sample by examining unambiguous individuals, that is, the complete homozygotes and the 
single-site heterozygotes. Then other individuals in the same sample are screened for the 
possible occurrence of previously recognised haplotypes. For each positive identification, the 
complementary haplotype is added to the list of recognised haplotypes, until the phase 
information for all individuals is either resolved or identified as unresolved. This method 
assigns a single haplotype to each multiheterozygous individual, whereas several haplotypes are 
possible when there are more than one heterozygous site. Alternatively, one can use methods 
estimating haplotype frequencies in a population without assigning haplotypes to each 
individual. Preferably, a method based on an expectation-maximization (EM) algorithm 
(Dempster et al, J. R. 1977) leading to maximum-likelihood estimates of haplotype frequencies 
under the assumption of Hardy- Weinberg proportions (random mating) is used (see Excoffier L. 
and Slatkin M., 1995). The EM algorithm is a generalised iterative maximum-likelihood 
approach to estimation that is useful when data are ambiguous and/or incomplete. The EM 
algorithm is used to resolve heterozygotes into haplotypes. Haplotype estimations are further 
described below under the heading "Statistical methods». Any other method known in the art to 
determine or to estimate the frequency of a haplotype in a population may also be used. 
2) Linkage Disequilibrium analysis 

Linkage disequilibrium is the non-random association of alleles at two or more loci and 
represents a powerful tool for mapping genes involved in disease traits (see Ajioka R.S. et al., 
1 997). Biallelic markers, because they are densely spaced in the human genome and can be ' 
genotyped in more numerous numbers than other types of genetic markers (such as RFLP or 
VNTR markers), are particularly useful in genetic analysis based on linkage disequilibrium. 
The biallelic markers of the present invention may be used in any linkage disequilibrium 
analysis method known in the art. 

Briefly, when a disease mutation is first introduced into a population (by a new 
mutation or the immigration of a mutation carrier), it necessarily resides on a single 
chromosome and thus on a single "background" or "ancestral" haplotype of linked markers. 
Consequently, there is complete disequilibrium between these markers and the disease 
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mutation: one finds the disease mutation only in the presence of a specific set of marker alleles. 
Through subsequent generations recombinations occur between the disease mutation and these 
marker polymorphisms, and the disequilibrium gradually dissipates. The pace of this 
dissipation is a function of the recombination frequency, so the markers closest to the disease 
gene will manifest higher levels of disequilibrium than those that are further away. When not 
broken up by recombination, "ancestral" haplotypes and linkage disequilibrium between marker 
alleles at different loci can be tracked not only through pedigrees but also through populations. 
Linkage disequilibrium is usually seen as an association between one specific allele at one locus 
and another specific allele at a second locus. 

The pattern or curve of disequilibrium between disease and marker loci is expected to 
exhibit a maximum that occurs at the disease locus. Consequently, the amount of linkage 
disequilibrium between a disease allele and closely linked genetic markers may yield valuable 
information regarding the location of the disease gene. For fine-scale mapping of a disease 
locus, it is useful to have some knowledge of the patterns of linkage disequilibrium that exist 
between markers in the studied region. As mentioned above the mapping resolution achieved 
through the analysis of linkage disequilibrium is much higher than that of linkage studies. The 
high density of biallelic markers combined with linkage disequilibrium analysis provides 
powerful tools for fine-scale mapping. Different methods to calculate linkage disequilibrium 
are described below under the heading "Statistical Methods". 

3) Population-based case-control studies of trait-marker associations 
As mentioned above, the occurrence of pairs of specific alleles at different loci on the 
same chromosome is not random and the deviation from random is called linkage 
disequilibrium. Association studies focus on population frequencies and rely on the 
phenomenon of linkage disequilibrium. If a specific allele in a given gene is directly involved 
in causing a particular trait, its frequency will be statistically increased in an affected (trait 
positive) population, when compared to the frequency in a trait negative population or in a 
random control population. As a consequence of the existence of linkage disequilibrium, the 
frequency of all other alleles present in the haplotype carrying the trait-causing allele will also 
be increased in trait positive individuals compared to trait negative individuals or random 
controls. Therefore, association between the trait and any allele (specifically a biallelic marker 
allele) in linkage disequilibrium with the trait-causing allele will suffice to suggest the presence 
of a trait-related gene in that particular region. Case-control populations can be genotyped for 
biallelic markers to identify associations that narrowly locate a trait causing allele. As any 
marker in linkage disequilibrium with one given marker associated with a trait will be 
associated with the trait. Linkage disequilibrium allows the relative frequencies in case-control 
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populations of a limited number of genetic polymorphisms (specifically biallelic markers) to be 
analysed as an alternative to screening all possible functional polymorphisms in order to find 
trait-causing alleles. Association studies compare the frequency of marker alleles in unrelated 
case-control populations, and represent powerful tools for the dissection of complex traits. 
Case-control populations (inclusion criteria) 

Population-based association studies do not concern familial inheritance but compare 
the prevalence of a particular genetic marker, or a set of markers, in case-control populations. 
They are case-control studies based on comparison of unrelated case (affected or trait positive) 
individuals and unrelated control (unaffected or trait negative or random) individuals. 
Preferably the control group is composed of unaffected or trait negative individuals. Further, 
the control group is ethnically matched to the case population. Moreover, the control group is 
preferably matched to the case-population for the main known confusion factor for the trait 
under study (for example age-matched for an age-dependent trait). Ideally, individuals in the 
two samples are paired in such a way that they are expected to differ only in their disease status. 
In the following "trait positive population)), "case population" and "affected population" are 
used interchangeably. 

An important step in the dissection of complex traits using association studies is the 
choice of case-control populations (see Lander and Schork, 1 994). A major step in the choice 
of case-control populations is the clinical definition of a given trait or phenotype. Any genetic 
trait may be analysed by the association method proposed here by carefully selecting the 
individuals to be included in the trait positive and trait negative phenotypic groups. Four 
criteria are often useful: clinical phenotype, age at onset, family history and severity. The 
selection procedure for continuous or quantitative traits (such as blood pressure for example) 
involves selecting individuals at opposite ends of the phenotype distribution of the trait under 
study, so as to include in these trait positive and trait negative populations individuals with non- 
overlapping phenotypes. Preferably, case-control populations comprise phenotypically 
homogeneous populations. Trait positive and trait negative populations comprise 
phenotypically uniform populations of individuals representing each between 1 and 98%, 
preferably between 1 and 80%, more preferably between 1 and 50%, and more preferably 
between 1 and 30%, most preferably between 1 and 20% of the total population under study, 
and selected among individuals exhibiting non-overlapping phenotypes. The clearer the 
difference between the two trait phenotypes, the greater the probability of detecting an 
association with biallelic markers. The selection of those drastically different but relatively 
uniform phenotypes enables efficient comparisons in association studies and the possible 
detection of marked differences at the genetic level, provided that the sample sizes of the 
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populations under study are significant enough. 

In preferred embodiments, a first group of between 50 and 300 trait positive 
individuals, preferably about 100 individuals, are recruited according to their phenotypes. A 
similar number of trait negative individuals are included in such studies. 

In the present invention, typical examples of inclusion criteria include affection by 
schizophrenia. 

Association analysis 

The general strategy to perform association studies using biallelic markers derived from 
a region carrying a candidate gene is to scan two groups of individuals (case-control 
populations) in order to measure and statistically compare the allele frequencies of the biallelic 
markers of the present invention in both groups. 

If a statistically significant association with a trait is identified for at least one or more 
of the analysed biallelic markers, one can assume that: either the associated allele is directly 
responsible for causing the trait (the associated allele is the trait causing allele), or more likely 
the associated allele is in linkage disequilibrium with the trait causing allele. The specific 
characteristics of the associated allele with respect to the gene function usually gives further 
insight into the relationship between the associated allele and the trait (causal or in linkage 
disequilibrium). If the evidence indicates that the associated allele within the gene is most 
probably not the trait causing allele but is in linkage disequilibrium with the real trait causing 
allele, then the trait causing allele can be found by sequencing the vicinity of the associated 
marker. 

Another embodiment of the present invention encompasses methods of detecting an 
association between a haplotype and a phenotype, comprising the steps of: a) estimating the 
frequency of at least one haplotype in a trait positive population according to a method of 
estimating the frequency of a haplotype of the invention; b) estimating the frequency of said 
haplotype in a control population according to the method of estimating the frequency of a 
haplotype of the invention; and c) determining whether a statistically significant association 
exists between said haplotype and said phenotype. In addition, the methods of detecting an 
association between a haplotype and a phenotype of the invention encompass methods with any 
further limitation described in this disclosure, or those following, specified alone or in any 
combination: Optionally, said 13q3 l-q33-related biallelic marker may be in a sequence 
selected individually or in any combination from the group consisting of SEQ ID Nos 1 to 17, 
24, 25, 28, 74 and 79 to 132, and the complements thereof; optionally, said I3q31-q33-related 
biallelic marker may be selected individually or in any combination from the biallelic markers 
described in Tables 6b and 6c; optionally, said control population may be a trait negative 
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population, or a random population; optionally, said phenotype is a disease involving 
schizophrenia, a response to an agent acting on schizophrenia, or a side effects to an agent 
acting on schizophrenia. 

Haplotype analysis 

As described above, when a chromosome carrying a disease allele first appears in a 
population as a result of either mutation or migration, the mutant allele necessarily resides on a 
chromosome having a set of linked markers: the ancestral haplotype. This haplotype can be 
tracked through populations and its statistical association with a given trait can be analysed. 
Complementing single point (allelic) association studies with multi-point association studies 
also called haplotype studies increases the statistical power of association studies. Thus, a 
haplotype association study allows one to define the frequency and the type of the ancestral 
carrier haplotype. A haplotype analysis is important in that it increases the.statistical power of 
an analysis involving individual markers. 

In a first stage of a haplotype frequency analysis, the frequency of the possible 
haplotypes based on various combinations of the identified biallelic markers of the invention is 
determined. The haplotype frequency is then compared for distinct populations of trait positive 
and control individuals. The number of trait positive individuals, which should be, subjected to 
this analysis to obtain statistically significant results usually ranges between 30 and 300, with a 
preferred number of individuals ranging between 50 and 1 50. The same considerations apply to 
the number of unaffected individuals (or random control) used in the study. The results of this 
first analysis provide haplotype frequencies in case-control populations, for each evaluated 
haplotype frequency a p-value and an odd ratio are calculated. If a statistically significant 
association is found the relative risk for an individual carrying the given haplotype of being 
affected with the trait under study can be approximated. 
Interaction Analysis 

The biallelic markers of the present invention may also be used to identify patterns of 
biallelic markers associated with detectable traits resulting from polygenic interactions. The 
analysis of genetic interaction between alleles at unlinked loci requires individual genotyping 
using the techniques described herein. The analysis of allelic interaction among a selected set 
of biallelic markers with appropriate level of statistical significance can be considered as a 
haplotype analysis. Interaction analysis comprises stratifying the case-control populations with 
respect to a given haplotype for the first loci and performing a haplotype analysis with the 
second loci with each subpopulation. 

Statistical methods used in association studies are further described herein. 

4) Testing for linkage in the presence of association 
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The biallelic markers of the present invention may further be used in TDT 
(transmission/disequilibrium test). TDT tests for both linkage and association and is not 
affected by population stratification. TDT requires data for affected individuals and their 
parents or data from unaffected sibs instead of from parents (see Spielmann S. et al., 1993; 
SchaidD.J. et al., 1996, Spielmann S. and Ewens W.J, 1 998). Such combined tests generally 
reduce the false - positive errors produced by separate analyses. 

Statistical methods 

In general, any method known in the art to test whether a trait and a genotype show a 
statistically significant correlation may be used. 

1) Methods in linkage analysis 

Statistical methods and computer programs useful for linkage analysis are well-known 
to those skilled in the art (see Terwilliger J.D. and Ott J., 1994; Ott J., 1991). 

2) Methods to estimate haplotype frequencies in a population 

As described above, when genotypes are scored, it is often not possible to distinguish 
heterozygotes so that haplotype frequencies cannot be easily inferred. When the gametic phase 
is not known, haplotype frequencies can be estimated from the multilocus genotypic data. Any 
method known to person skilled in the art can be used to estimate haplotype frequencies (see 
Lange K., 1 997; Weir, B.S., 1 996) Preferably, maximum-likelihood haplotype frequencies are 
computed using an Expectation- Maximization (EM) algorithm (see Dempster et al., 1 977; 
Excoffier L. and Slatkin M., 1995). This procedure is an iterative process aiming at obtaining 
maximum-likelihood estimates of haplotype frequencies from multi-locus genotype data when 
the gametic phase is unknown. Haplotype estimations are usually performed by applying the 
EM algorithm using for example the EM-HAPLO program (Hawley M.E. et al., 1994) or the 
Arlequin program (Schneider et al., 1 997). The EM algorithm is a generalised iterative 
maximum likelihood approach to estimation and is briefly described below. 

In the following part of this text, phenotypes will refer to multi-locus genotypes with 
unknown phase. Genotypes will refer to known-phase multi-locus genotypes. Suppose a 
sample of N unrelated individuals typed for K markers. The data observed are the unknown- 
phase K-locus phenotypes that can categorised in F different phenotypes. Suppose that we have 
H underlying possible hapiotypes (in case of K biallelic markers, H=2 K ). 

For phenotype j, suppose that Cj genotypes are possible. We thus have the following 
equation 

c j c, 
p j = I PKgenotypet) = £ pr{h k ,A,) Equation 1 

/=1 /=i 
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where Pj is the probability of the phenotype j, h k and hi are the two haplotypes constituent the 
genotype /. Under the Hardy- Weinberg equilibrium, pr(h h h() becomes : 

Pr(h k ,hi) = pr{h k ) 2 if h k = h h pr(k k9 hi) = 2pr(k k ).pr(hi)if h k * h f . Equation 2 
The successive steps of the E-M algorithm can be described as follows: 

Starting with initial values of the of haplotypes frequencies, noted pf\pf \ , 

these initial values serve to estimate the genotype frequencies (Expectation step) and then 

estimate another set of haplotype frequencies (Maximisation step), noted p^KpjK p$ , 

these two steps are iterated until changes in the sets of haplotypes frequency are very small. 

A stop criterion can be that the maximum difference between haplotype frequencies 
between two iterations is less than 10" 7 . These values can be adjusted according to the desired 
precision of estimations. In details, at a given iteration s, the Expectation step comprises 
calculating the genotypes frequencies by the following equation: 

pr (genotype^ = pr (phenotype j)pr (genotype ^phenotype j)^ 

_ n j P r (h k , hi ) (5 ^ Equation 3 

where genotype / occurs in phenotype j\ and where h k and hi constitute genotype Each 
probability is derived according to eq.l, and eq.2 described above. 

Then the Maximisation step simply estimates another set of haplotype frequencies 
given the genotypes frequencies. This approach is also known as gene-counting method (Smith, 
1957). 

Pr 5+1)== ^S L^it Pf(genotypei)^ Equation 4 

2 j=\i=\ 

Where S lt is an indicator variable which count the number of time haplotype t in genotype /. It 
takes the values of 0, I or 2. 

To ensure that the estimation finally obtained is the maximum-likelihood estimation 
several values of departures are required. The estimations obtained are compared and if they are 
different the estimations leading to the best likelihood are kept. 
3) Methods to calculate linkage disequilibrium between markers 

A number of methods can be used to calculate linkage disequilibrium between any two 
genetic positions, in practice linkage disequilibrium is measured by applying a statistical 
association test to haplotype data taken from a population. Linkage disequilibrium between any 
pair of biallelic markers comprising at least one of the biallelic markers of the present invention 
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(M i? Mj) having alleles (a/bj) at marker M; and alleles (a/bj) at marker Mj can be calculated for 
every allele combination (a^aj ; aj,b j; bi,aj and bj,bj), according to the Piazza formula : 
Aaiaj= V04 - V (64 + 93) (04 +92), where : 

64= - - = frequency of genotypes not having allele a,- at Mj and not having allele aj at Mj 
03= - + = frequency of genotypes not having allele a 4 at Mj and having allele aj at Mj 
02= + - = frequency of genotypes having allele a; at Mi and not having allele ^ at Mj 
Linkage disequilibrium (LD) between pairs of biallelic markers (Mi, Mj) can also be calculated 
for every allele combination (ai,aj ; ai,bj ; b i5 aj and bj,bj), according to the maximum-likelihood 
estimate (MLE) for delta (the composite genotypic disequilibrium coefficient), as described by 
Weir (Weir B.S., 1996). The MLE for the composite linkage disequilibrium is: 
Daiaj= (2n, + n 2 + n 3 + iuf2}/N - 2(pr(ai).pr(aj)) 

where nj = Z phenotype (a^a,-, a/aj), n 2 - Z phenotype (a/a;, a/bj), n 3 = Z phenotype (a/bi, V a jX 
n4= Z phenotype (aj/bj, aj/bj) and N is the number of individuals in the sample. This formula 
allows linkage disequilibrium between alleles to be estimated when only genotype, and not 
haplotype, data are available. 

Another means of calculating the linkage disequilibrium between markers is as follows. 
For a couple of biallelic markers, M, (a/bj) and Mj(a/bj), fitting the Hardy- Weinberg 
equilibrium, one can estimate the four possible haplotype frequencies in a given population 
according to the approach described above. 

The estimation of gametic disequilibrium between ai and aj is simply: 

&aiaj = Pr(keptotype(<ii><* pr(a t ).pr(a j). 

Where pr(a-J is the probability of allele a, and pr(a^l is the probability of allele tf, and where 
pr{haplotype (a u a)) is estimated as in Equation 3 above. 

For a couple of biallelic marker only one measure of disequilibrium is necessary to 
describe the association between Mi and Mj. 
Then a normalised value of the above is calculated as follows: 

D' aiaj = D a i aj / max (-pr^O-pr^) , -pr(bi).pr(bj)) with D aiij <0 
D' aiaj = D fllaj / max (prCbO.prty) , pr(ai).pr(bj)) with D aUj >0 
The skilled person will readily appreciate that other LD calculation methods can be used 
without undue experimentation. 

Linkage disequilibrium among a set of biallelic markers having an adequate 
heterozygosity rate can be determined by genotyping between 50 and 1000 unrelated 
individuals, preferably between 75 and 200, more preferably around 100. 
4) Testing for association 
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Methods for determining the statistical significance of a correlation between a 
phenotype and a genotype, in this case an allele at a biallelic marker or a haplotype made up of 
such alleles, may be determined by any statistical test known in the art and with any accepted 
threshold of statistical significance being required. The application of particular methods and 
thresholds of significance are well with in the skill of the ordinary practitioner of the art. 

Testing for association is performed by determining the frequency of a biallelic marker 
allele in case and control populations and comparing these frequencies with a statistical test to 
determine if their is a statistically significant difference in frequency which would indicate a 
correlation between the trait and the biallelic marker allele under study. Similarly, a haplotype 
analysis is performed by estimating the frequencies of all possible haplotypes for a given set of 
biallelic markers in case and control populations, and comparing these frequencies with a 
statistical test to determine if their is a statistically significant correlation between the haplotype 
and the phenotype (trait) under study. Any statistical tool useful to test for a statistically 
significant association between a genotype and a phenotype may be used. Preferably the 
statistical test employed is a chi-square test with one degree of freedom. A P- value is calculated 
(the P-value is the probability that a statistic as large or larger than the observed one would 
occur by chance). 
Statistical significance 

In preferred embodiments, significance for diagnosis purposes, either as a positive basis 
for further diagnostic tests or as a preliminary starting point for early preventive therapy, the p 
value related to a biallelic marker association is preferably about 1 x 10' 2 or less, more 
preferably about 1 x 10" 4 or less, for a single biallelic marker analysis and about 1 x 10" 3 or less, 
still more preferably 1 x KT 6 or less and most preferably of about 1 x 10' 8 or less, for a 
haplotype analysis involving several markers. These values are believed to be applicable to any 
association studies involving single or multiple marker combinations. 

The skilled person can use the range of values set forth above as a starting point in 
order to carry out association studies with biallelic markers of the present invention. In doing 
so, significant associations between the biallelic markers of the present invention and diseases 
involving schizophrenia can be revealed and used for diagnosis and drug screening purposes. 
Phenotypic permutation 

In order to confirm the statistical significance of the first stage haplotype analysis 
described above, it might be suitable to perform further analyses in which genotyping data from 
case-control individuals are pooled and randomised with respect to the trait phenotype. Each 
individual genotyping data is randomly allocated to two groups, which contain the same number 
of individuals as the case-control populations used to compile the data obtained in the first 
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stage. A second stage haplotype analysis is preferably run on these artificial groups, preferably 
for the markers included in the haplotype of the first stage analysis showing the highest relative 
risk coefficient. This experiment is reiterated preferably at least between 100 and 10000 times. 
The repeated iterations allow the determination of the percentage of obtained haplotypes with a 
significant p-value level. 
Assessment of statistical association 

To address the problem of false positives similar analysis may be performed with the 
same case-control populations in random genomic regions. Results in random regions and the 
candidate region are compared as described in US Provisional Patent Application entitled 
"Methods, software and apparati for identifying genomic regions harbouring a gene associated 
with a detectable trait". 
5) Evaluation of risk factors 

The association between a risk factor (in genetic epidemiology the risk factor is the 
presence or the absence of a certain allele or haplotype at marker loci) and a disease is measured 
by the odds ratio (OR) and by the relative risk (RR). If P(R + ) is the probability of developing 
the disease for individuals with R and P(R") is the probability for individuals without the risk 
factor, then the relative risk is simply the ratio of the two probabilities, that is: 



In case-control studies, direct measures of the relative risk cannot be obtained because of the 
sampling design. However, the odds ratio allows a good approximation of the relative risk for 
low-incidence diseases and can be calculated: 



F + is the frequency of the exposure to the risk factor in cases and F is the frequency of the 
exposure to the risk factor in controls. F + and F are calculated using the allelic or haplotype 
frequencies of the study and further depend on the underlying genetic model (dominant, 
recessive, additive...). 

One can further estimate the attributable risk (AR) which describes the proportion of individuals 
in a population exhibiting a trait due to a given risk factor. This measure is important in 
quantitating the role of a specific factor in disease etiology and in terms of the public health 
impact of a risk factor. The public health relevance of this measure lies in estimating the 
proportion of cases of disease in the population that could be prevented if the exposure of 
interest were absent. AR is determined as follows: 



RR= P(R + )/P(R") 




AR = P E (RR-l)/ (P E (RR-])+]) 
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AR is the risk attributable to a biallelic marker allele or a biallelic marker haplotype. P E is the 
frequency of exposure to an allele or a haplotype within the population at large; and RR is the 
relative risk which, is approximated with the odds ratio when the trait under study has a 
relatively low incidence in the general population. 

AR is the risk attributable to a biallelic marker allele or a biallelic marker haplotype. 
Pg is the frequency of exposure to an allele or a haplotype within the population at large; and 
RR is the relative risk which, is approximated with the odds ratio when the trait under study has 
a relatively low incidence in the general population. 

Association of Biallelic Markers of the Invention with Schizophrenia 

In the context of the present invention, an association between chromosome 13q31-q33- 
related biallelic markers and schizophrenia and bipolar disorder were established. Several 
association studies using different populations and screening samples thereof, and with different 
sets of biallelic markers distributed on the chromosome 13q31-q33 region were carried out. 
Further details concerning these association studies and the results are provided herein in 
Examples 5a to 5e. 

This information is extremely valuable. The knowledge of a potential genetic 
predisposition to schizophrenia, even if this predisposition is not absolute, might contribute in a 
very significant manner to treatment efficacy of schizophrenia and to the development of new 
. therapeutic and diagnostic tools. 

Identification Of Biallelic Markers In Linkage Disequilibrium With The Biallelic 
Markers of the Invention 

Once a first biallelic marker has been identified in a genomic region of interest, the 
practitioner of ordinary skill in the art, using the teachings of the present invention, can easily 
identify additional biallelic markers in linkage disequilibrium with this first marker. As 
mentioned before, any marker in linkage disequilibrium with a first marker associated with a 
trait will be associated with the trait. Therefore, once an association has been demonstrated 
between a given biallelic marker and a trait, the discovery of additional biallelic markers 
associated with this trait is of great interest in order to increase the density of biallelic markers 
in this particular region. The causal gene or mutation will be found in the vicinity of the marker 
or set of markers showing the highest correlation with the trait. 

Identification of additional markers in linkage disequilibrium with a given marker 
involves: (a) amplifying a genomic fragment comprising a first biallelic marker from a plurality 
of individuals; (b) identifying of second biallelic markers in the genomic region harboring said 
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first biallelic marker; (c) conducting a linkage disequilibrium analysis between said first 
biallelic marker and second biallelic markers; and (d) selecting said second biallelic markers as 
being in linkage disequilibrium with said first marker. Subcombinations comprising steps (b) 
and (c) are also contemplated. 

Methods to identify biallelic markers and to conduct linkage disequilibrium analysis are 
described herein and can be carried out by the skilled person without undue experimentation. 
The present invention then also concerns biallelic markers and other polymorphisms which are 
in linkage disequilibrium with the specific biallelic markers of the invention and which are 
expected to present similar characteristics in terms of their respective association with a given 
trait. In a preferred embodiment, the invnetion concerns biallelic markers which are in linkage 
disequilibrium with the specific biallelic markers. 

Identification Of Functional Mutations 

Once a positive association is confirmed with a biallelic marker of the present 
invention, the associated candidate gene sequence can be scanned for mutations by comparing 
the sequences of a selected number of trait positive and trait negative individuals. In a preferred 
embodiment, functional regions such as exons and splice sites, promoters and other regulatory 
regions of the gene are scanned for mutations. Preferably, trait positive individuals carry the 
haplotype shown to be associated with the trait and trait negative individuals do not carry the 
haplotype or allele associated with the trait. The mutation detection procedure is essentially 
similar to that used for biallelic site identification. 

The method used to detect such mutations generally comprises the following steps: (a) 
amplification of a region of the candidate DNA sequence comprising a biallelic marker or a 
group of biallelic markers associated with the trait from DNA samples of trait positive patients 
and trait negative controls; (b) sequencing of the amplified region; (c) comparison of DNA 
sequences from trait-positive patients and trait-negative controls; and (d) determination of 
mutations specific to trait-positive patients. Subcombinations which comprise steps (b) and (c) 
are specifically contemplated. 

It is preferred that candidate polymorphisms be then verified by screening a larger 
population of cases and controls by means of any genotyping procedure such as those described 
herein, preferably using a microsequencing technique in an individual test format. 
Polymorphisms are considered as candidate mutations when present in cases and controls at 
frequencies compatible with the expected association results. 

Candidate polymorphisms and mutations of the g35030 nucleic acid sequences 
suspected of being involved in a predisposition to schizophrenia can be confirmed by screening 
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a larger population of affected and unaffected individuals using any of the genotyping 
procedures described herein. Preferably the microsequencing technique is used. Such 
polymorphisms are considered as candidate "trait-causing" mutations when they exhibit a 
statistically significant correlation with the detectable phenotype. 

Biallelic Markers Of The Invention In Methods Of Genetic Diagnostics 
The biallelic markers and other polymorphisms of the present invention can also be 
used to develop diagnostics tests capable of identifying individuals who express a detectable 
trait as the result of a specific genotype or individuals whose genotype places them at risk of 
developing a detectable trait at a subsequent time. The trait analyzed using the present 
diagnostics may be any detectable trait, including predisposition to schizophrenia, age of onset 
of detectable symptoms, a beneficial response to or side effects related to treatment against 
schizophrenia. Such a diganosis can be useful in the monitoring, prognosis and/or prophylactic 
or curative therapy for schizophrenia. 

The diagnostic techniques of the present invention may employ a variety of 
methodologies to determine whether a test subject has a genotype associated with an increased 
risk of developing a detectable trait or whether the individual suffers from a detectable trait as a 
result of a particular mutation, including methods which enable the analysis of individual 
chromosomes for haplotyping, such as family studies, single sperm DNA analysis or somatic 
hybrids. 

The diagnostic techniques concern the detection of specific alleles present within the 
human chromosome I3q31-q33 region and optionally within a g35030 nucleic acid sequence. 
More particularly, the invention concerns the detection of a nucleic acid comprising at least one 
of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25,28, 74 and 79 to 132 or a fragment 
thereof or a complementary sequence thereto including the polymorphic base. 

These methods involve obtaining a nucleic acid sample from the individual and, 
determining, whether the nucleic acid sample contains at least one allele or at least one biallelic 
marker haplotype, indicative of a risk of developing the trait or indicative that the individual 
expresses the trait as a result of possessing a particular the human chromosome 1 3q3 l-q33- or 
g35030-related polymorphism or mutation (trait-causing allele). 

Preferably, in such diagnostic methods, a nucleic acid sample is obtained from the 
individual and this sample is genotyped using methods described above in "Methods Of 
Genotyping DNA Samples For Biallelic markers." The diagnostics may be based on a single 
biallelic marker or a on group of biallelic markers. 

In each of these methods, a nucleic acid sample is obtained from the test subject and the 



W0 01/40493 PCT/IB00/01507 

100 

biallelic marker pattern of one or more of the biallelic markers of the invention is determined. 

In one embodiment, a PCR amplification is conducted on the nucleic acid sample to 
amplify regions in which polymorphisms associated with a detectable phenotype have been 
identified. The amplification products are sequenced to determine whether the individual 
possesses one or more human chromosome 13q31-q33 region- or g35030-related 
polymorphisms associated with a detectable phenotype. The primers used to generate 
amplification products may comprise the primers listed in Table 4. Alternatively, the nucleic 
acid sample is subjected to microsequencing reactions as described above to determine whether 
the individual possesses one or more human chromosome 13q31-q33 region-related 
polymorphisms associated with a detectable phenotype resulting from a mutation or a 
polymorphism in the human chromosome 13q3 1-q33 region or g35030-related biallelic marker. 
The primers used in the microsequencing reactions may include the primers listed in 6. In 
another embodiment, the nucleic acid sample is contacted with one or more allele specific 
oligonucleotide probes which, specifically hybridize to one or more human chromosome 13q31- 
q33 region or g35030-related alleles associated with a detectable phenotype. The probes used 
in the hybridization assay may include the probes listed in Table 5. In another embodiment, the 
nucleic acid sample is contacted with a second oligonucleotide capable of producing an 
amplification product when used with the allele specific oligonucleotide in an amplification 
reaction. The presence of an amplification product in the amplification reaction indicates that 
the individual possesses one or more human chromosome 13q31-q33 region or g35030-related 
alleles associated with a detectable phenotype. 

In a preferred embodiment the identity of the nucleotide present at, at least one, biallelic 
marker selected from the group consisting of A 13 to Al 8, A20 to A46, A49 to A52, A55, A57, 
A59 to A63, A72 to A73, A76 and A123 and the complements thereof, is determined and the 
detectable trait is schizophrenia. Diagnostic kits comprise any of the polynucleotides of the 
present invention. 

These diagnostic methods are extremely valuable as they can, in certain circumstances, 
be used to initiate preventive treatments or to allow an individual carrying a significant 
haplotype to foresee warning signs such as minor symptoms. 

Diagnostics, which analyze and predict response to a drug or side effects to a drug, may 
be used to determine whether an individual should be treated with a particular drug. For 
example, if the diagnostic indicates a likelihood that an individual will respond positively to 
treatment with a particular drug, the drug may be administered to the individual. Conversely, if 
the diagnostic indicates that an individual is likely to respond negatively to treatment with a 
particular drug, an alternative course of treatment may be prescribed. A negative response may 



WO 01/40493 

PCT/IB00/01507 

101 

be defined as either the absence of an efficacious response or the presence of toxic side effects. 

Clinical drug trials represent another application for the markers of the present 
invention. One or more markers indicative of response to an agent acting against schizophrenia 
or to side effects to an agent acting against schizophrenia may be identified using the methods 
described above. Thereafter, potential participants in clinical trials of such an agent may be 
screened to identify those individuals most likely to respond favorably to the drug and exclude 
those likely to experience side effects. In that way, the effectiveness of drug treatment may be 
measured in individuals who respond positively to the drug, without lowering the measurement 
as a result of the inclusion of individuals who are unlikely to respond positively in the study and 
without risking undesirable safety problems. 

PREVENTION, DIAGNOSIS AND TREATMENT OF PSYCHIATRIC DISEASE 

An aspect of the present invention relates to the preparation of a medicament for the 
treatment of psychiatric disease, in particular schizophrenia and bipolar disorder. The present 
invention embodies medicaments acting on g35030. 

In preferred embodiments, medicaments of the invention act on g35030, either directly or 
indirectly, by acting on the g35030 pathways. For example, the medicaments may modulate, and 
more preferably decrease the level of g35030 activity which occurs in a cell or particular tissue, or 
increase or descrease the activity of the g35030 protein. In certain embodiments, the invention' 
thus comprises use of a compound capable of increasing or decreasing g35030 expression or 
g35030 protein activity in the preparation or manufacture of a medicament. Preferably, said 
compound is used for the treatment of a psychiatric disease, preferably for the treatment of 
schizophrenia or bipolar disorder. Preferably, said compound acts directly by binding to g35030 
of a g35030 receptor. 

Such medicaments may also increase or decrease the activity of a compound analogous to 
g35030, a compound comprising an amino acid sequence having at least 25% homology to a 
sequence selected from the group consisting ofSEQ ID NOs. 18 to 23, a compound comprising an 
amino acid sequence having at least 50% homology to a sequence selected from the group 
consisting of SEQ ID NOs. 18 to 23, and a compound comprising an amino acid sequence having 
at least 80% homology to a sequence selected from the group consisting of SEQ ID NOs. 1 8 to 
23. 

Medicaments which increase or descrease the activity of these compounds in an 
individual may be used to ameliorate or prevent symptoms in individuals suffering from or 
predisposed to a psychiatric disease, as discussed above in the section entitled "indications". 

Alternatively, g35030 activity may be increased or decreased by the expression of the 
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genes encoding the identified g35030-modulating compounds using gene therapy. Examples of 
vectors and promoters suitable for use in gene therapy are described above. G35030 activity may 
also be increased or decreased by preparing an antibody which binds to a g35030 peptide, a 
g35030 receptor or a protein related thereto, as well as fragments of these proteins. Such 
antibodies may modulate the interaction between g35030 and a g35030 receptor or a protein 
related thereto. Antibodies and methods of obtaining them are further described herein. 

As described above, the present invention provides cellular assays for identifying 
compounds for the treatment of psychiatric disease. The assays are based on detection of g35030 
expression, measurement of g35030 protein activity, or based on the determination of other 
suitable schizophrenia, bipolar disorder or related psychiatric disease endpoints. Compounds for 
the treatment of psychiatric disease include derivative proteins or peptides which are capable of 
inhibiting the activity of a wild type g35030 protein, which may be identified by determining their 
ability to bind a wild type g35030 protein. Compounds also include antibodies, and small 
molecules and drugs which may be obtained using a variety of synthetic approaches familiar to 
those skilled in the art, including combinatorial chemistry based techniques. 

The invention further encompasses said methods for the prevention, treatment, and 
diagnosis of disease using any of the g35030 nucleic acids of proteins of the invention in 
analogous methods. 

G35030 in Methods of Diagnosis or Detecting Predisposition 

Individuals affected by or predisposed to schizophrenia and bipolar disorder may express 
abnormal levels of g35030. Individuals having increased or decreased g35030 activity in their 
plasma, body fluids, or body tissues may be at risk of devloping schizophrenia, bipolar disorder or 
a variety of potentially related psychiatric conditions. In one aspect of the present invention is a 
method for determining whether an individual is at risk of suffering from or is currently suffering 
from schizophrenia, bipolar disorder or other psychotic disorders, mood disorders, autism, 
substance dependence or alcoholism, mental retardation, or other psychiatric diseases including 
cognitive, anxiety, eating, impulse-control, and personality disorders, as defined with the 
Diagnosis and Statistical Manual of Mental Disorders fourth edition (DSM-IV) classification, 
comprising determining whether the individual has an abnormal level of g35030 activity in 
plasma, body fluids, or body tissues. The level of g35030 or analogous compounds in plasma, 
body fluids, or body tissues may be determined using a variety approaches. In particular, the level 
may be determined using EL1SA, Western Blots, or protein electrophoresis. 

BiaUelic Markers Of The Invention In Methods Of Generic Diagnostics 
The biallelic markers and other polymorphisms of the present invention can also be 
used to develop diagnostics tests capable of identifying individuals who express a detectable 
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trait as the result of a specific genotype or individuals whose genotype places them at risk of 
developing a detectable trait at a subsequent time. The trait analyzed using the present 
diagnostics may be used to diagnose any detectable trait, including predisposition to 
schizophrenia or bipolar disorder, age of onset of detectable symptoms, a beneficial response to 
or side effects related to treatment against schizophrenia or bipolar disorder. Such a diagnosis 
can be useful in the monitoring, prognosis and/or prophylactic or curative therapy for 
schizophrenia or bipolar disorder. 

The diagnostic techniques of the present invention may employ a variety of 
methodologies to determine whether a test subject has a genotype associated with an increased 
risk of developing a detectable trait or whether the individual suffers from a detectable trait as a 
result of a particular mutation, including methods which enable the analysis of individual 
chromosomes for haplotyping, such as family studies, single sperm DNA analysis or somatic 
hybrids. 

The diagnostic techniques concern the detection of specific alleles present within the 
human chromosome 13q31-q33 region, and optionally within a g35030 nucleic acid sequence. 
More particularly, the invention concerns the detection of a nucleic acid comprising at least one 
of the nucleotide sequences of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 or a fragment 
thereof or a complementary sequence thereto including the polymorphic base. 

These methods involve obtaining a nucleic acid sample from the individual and, 
determining, whether the nucleic acid sample contains at least one allele or at least one biallelic 
marker haplotype, indicative of a risk of developing the trait or indicative that the individual 
expresses the trait as a result of possessing a particular the human chromosome 13q31-q33 
region-related polymorphism or mutation (trait-causing allele). 

Preferably, in such diagnostic methods, a nucleic acid sample is obtained from the 
individual and this sample is genotyped using methods described above in "Methods Of 
Genotyping DNA Samples For Biallelic markers." The diagnostics may be based on a single 
biallelic marker or a on group of biallelic markers. 

In each of these methods, a nucleic acid sample is obtained from the test subject and the 
biallelic marker pattern of one or more of a biallelic marker of the invention is determined. 

In one embodiment, a PCR amplification is conducted on the nucleic acid sample to 
amplify regions in which polymorphisms associated with a detectable phenotype have been 
identified. The amplification products are sequenced to determine whether the individual 
possesses one or more human chromosome 13q3 l-q33 region, g35030-related polymorphisms 
associated with a detectable phenotype. The primers used to generate amplification products 
may comprise the primers listed in Table 4. Alternatively, the nucleic acid sample is subjected 



W0 01/40493 PCT/IB00/01507 

104 

to microsequencing reactions as described above to determine whether the individual possesses 
one or more human chromosome 13q3I-q33 region, g35030-related polymorphisms associated 
with a detectable phenotype resulting from a mutation or a polymorphism in the human 
chromosome 13q3I-q33 region. The primers used in the microsequencing reactions may 
include the primers listed in Table 6. In another embodiment, the nucleic acid sample is 
contacted with one or more allele specific oligonucleotide probes which, specifically hybridize 
to one or more human chromosome 13q31-q33 region, g35030-related alleles associated with a 
detectable phenotype. The probes used in the hybridization assay may include the probes listed 
in 6b. In another embodiment, the nucleic acid sample is contacted with a second 
oligonucleotide capable of producing an amplification product when used with the allele 
specific oligonucleotide in an amplification reaction. The presence of an amplification product 
in the amplification reaction indicates that the individual possesses one or more human 
chromosome 13q31-q33 region, g3503O-related alleles associated with a detectable phenotype. 
In a preferred embodiment, the detectable trait is schizophrenia or bipolar disorder. Diagnostic 
kits comprise any of the polynucleotides of the present invention. 

These diagnostic methods are extremely valuable as they can, in certain circumstances, 
be used to initiate preventive treatments or to allow an individual carrying a significant 
haplotype to foresee warning signs such as minor symptoms. 

Diagnostics, which analyze and predict response to a drug or side effects to a drug, may 
be used to determine whether an individual should be treated with a particular drug. For 
example, if the diagnostic indicates a likelihood that an individual will respond positively to 
treatment with a particular drug, the drug may be administered to the individual. Conversely, if 
the diagnostic indicates that an individual is likely to respond negatively to treatment with a 
particular drug, an alternative course of treatment may be prescribed. A negative response may 
be defined as either the absence of an efficacious response or the presence of toxic side effects. 

Clinical drug trials represent another application for the markers of the present 
invention. One or more markers indicative of response to an agent acting against schizophrenia 
or to side effects to an agent acting against schizophrenia may be identified using the methods 
described above. Thereafter, potential participants in clinical trials of such an agent may be 
screened to identify those individuals most likely to respond favorably to the drug and exclude 
those likely to experience side effects. In that way, the effectiveness of drug treatment may be 
measured in individuals who respond positively to the drug, without lowering the measurement 
as a result of the inclusion of individuals who are unlikely to respond positively in the study and 
without risking undesirable safety problems. 

Prevention And Treatment Of Disease Using Biallelic Markers 
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In large part because of the risk of suicide, the detection of susceptibility to 
schizophrenia, bipolar disorder as well as other psychiatric disease in individuals is very 
important. Consequently, the invention concerns a method for the treatment of schizophrenia o 
bipolar disorder, or a related disorder comprising the following steps: 
- selecting an individual whose DNA comprises alleles of a biallelic marker or of a group of 
biallelic markers of the human chromosome 13q3 1-q33 region, preferably g35030-related 
markers associated with schizophrenia or bipolar disorder, 

- following up said individual for the appearance (and optionally the development) of 
the symptoms related to schizophrenia or bipolar disorder; and 

- administering a treatment acting against schizophrenia or bipolar disorder or against symptoms 
thereof to said individual at an appropriate stage of the disease. 

Another embodiment of the present invention comprises a method for the treatment of 
schizophrenia or bipolar disorder comprising the following steps: 

- selecting an individual whose DNA comprises alleles of a biallelic marker or of a 
group of biallelic markers, of the human chromosome 13q31-q33 region, and more preferably 
g35O30-related markers associated with schizophrenia or bipolar disorder; 

- administering a preventive treatment of schizophrenia or bipolar disorder to said individual. 

In a further embodiment, the present invention concerns a method for the treatment of 
schizophrenia or bipolar disorder comprising the following steps: 

- selecting an individual whose DNA comprises alleles of a biallelic marker or of a 
group of biallelic markers of the human chromosome 13q3 l-q33, and more preferably g35030- 
related markers associated with schizophrenia or bipolar disorder; 

- administering a preventive treatment of schizophrenia or bipolar disorder to said 
individual; 

- following up said individual for the appearance and the development of schizophrenia 
or bipolar disorder symptoms; and optionally 

- administering a treatment acting against schizophrenia or bipolar disorder or against 
symptoms thereof to said individual at the appropriate stage of the disease. 

For use in the determination of the course of treatment of an individual suffering from 
disease, the present invention also concerns a method for the treatment of schizophrenia or 
bipolar disorder comprising the following steps: 

- selecting an individual suffering from schizophrenia or bipolar disorder whose DNA 
comprises alleles of a biallelic marker or of a group of biallelic markers of the human 
chromosome 13q31-q33 region, and preferably g35030-related markers, associated with the 
gravity of schizophrenia or bipolar disorder or of the symptoms thereof; and 
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- administering a treatment acting against schizophrenia or bipolar disorder or 
symptoms thereof to said individual. 

The invention also concerns a method for the treatment of schizophrenia or bipolar 
disorder in a selected population of individuals. The method comprises: 

- selecting an individual suffering from schizophrenia or bipolar disorder and whose 
DNA comprises alleles of a biallelic marker or of a group of biallelic markers of the human 
chromosome 13q3 l-q33 region, and more preferably g35030-related markers associated with a 
positive response to treatment with an effective amount of a medicament acting against 
schizophrenia or bipolar disorder or symptoms thereof, 

■ and/or whose DNA does not comprise alleles of a biallelic marker or of a group of 
biallelic markers of the human chromosome 13q31-q33 region, and more preferably g35030- 
related markers associated with a negative response to treatment with said medicament; and 

- administering at suitable intervals an effective amount of said medicament to said 
selected individual. 

In the context of the present invention, a "positive response" to a medicament can be 
defined as comprising a reduction of the symptoms related to the disease. In the context of the 
present invention, a "negative response" to a medicament can be defined as comprising either a 
lack of positive response to the medicament which does not lead to a symptom reduction or 
which leads to a side-effect observed following administration of the medicament. 

The invention also relates to a method of determining whether a subject is likely to 
respond positively to treatment with a medicament. The method comprises identifying a first 
population of individuals who respond positively to said medicament and a second population 
of individuals who respond negatively to said medicament. One or more biallelic markers is 
identified in the first population which is associated with a positive response to said medicament 
or one or more biallelic markers is identified in the second population which is associated with a 
negative response to said medicament. The biallelic markers may be identified using the 
techniques described herein. 

A DNA sample is then obtained from the subject to be tested. The DNA sample is 
analyzed to determine whether it comprises alleles of one or more biallelic markers associated 
with a positive response to treatment with the medicament and/or alleles of one or more biallelic 
markers associated with a negative response to treatment with the medicament. 

In some embodiments, the medicament may be administered to the subject in a clinical 
trial if the DNA sample contains alleles of one or more biallelic markers associated with a 
positive response to treatment with the medicament and/or if the DNA sample lacks alleles of 
one or more biallelic markers associated with a negative response to treatment with the 
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code comprising a contiguous span of at least 6 amino acids of a polypeptide sequence selected 
from the group consisting of SEQ ID Nos. 1 8 to 23. 

54. A computer system comprising a processor and a data storage device wherein said data 
storage device comprises a computer readable medium according to any one of claims 52 or 53. 

55. A computer system according to claim 54, further comprising a sequence comparer and a 
data storage device having reference sequences stored thereon. 

56. A computer system of claim 55 wherein said sequence comparer comprises a computer 
program which indicates polymorphisms. 

57. A computer system of claim 55 further comprising an identifier which identifies features in 
said sequence. 

58. A method for comparing a first sequence to a reference sequence, comprising the steps of: 

a) reading said first sequence and said reference sequence through use of a computer 
program which compares sequences; and 

b) determining differences between said first sequence and said reference sequence with said 
computer program, 

wherein said first sequence is selected from the group consisting of a nucleic acid code 
comprising one of the following: 

(i) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, wherein 
said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 
1: 201 123 to 247802 and 199122 to 249803; 

(ii) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 17, and the complements 
thereof, to the extent that such a length is consistent with the particular sequence ID; 

(iii) a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90, 1 00 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, of SEQ ID Nos. 24, 25, 28, 74 and 79 to 132, or the complements thereof; 

(iv) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
span comprises at least one of the following nucleotide positions of SEQ ID No 1: 201 123 to 
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201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 

(v) a contiguous span according to (i) to (iv), wherein said span includes a biallelic marker 
selected from the group consisting of A I to A 127. 

(vi) a contiguous span of at least 12, 1 5, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1 000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 
wherein said span includes a polymorphism selected from the group consisting of A13 to A 18, 
A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123; 

(vii) a nucleotide sequence complementary to any one of the contiguous spans of (i) to (vi); 
and 

(viii) a polypeptide code comprising a contiguous span of at least 6 amino acids of a 
polypeptide sequence selected from the group consisting of SEQ ID Nos 1 8 to 23. 

59. A method according to claim 58, wherein said step of determining differences between the 
first sequence and the reference sequence comprises identifying at least one polymorphism. 

60. A method for identifying a feature in a sequence, comprising the steps of: 

a) reading said sequence through the use of a computer program which identifies features in 
sequences; and 

b) identifying features in said sequence with said computer program; 

wherein said sequence is selected from the group consisting of a nucleic acid code 
comprising one of the following: 

(i) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, wherein 
said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 
1 : 201 123 to 247802 and 199122 to 249803; 

(ii) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 17, and the complements 
thereof, to the extent that such a length is consistent with the particular sequence ID; 

(iii) a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90, 100 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, of SEQ ID Nos. 24, 25, 28, 74 and 79 to 132, or the complements thereof, 

(iv) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
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span comprises at least one of the following nucleotide positions of SEQ ID No 1: 201 123 to 
201234, 201123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 241153, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 

(v) a contiguous span according to (i) to (iv), wherein said span includes a biallelic marker 
selected from the group consisting of A I to A 127; 

(vi) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 
wherein said span includes a polymorphism selected from the group consisting of A 13 to A 18, 
A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 ; 

(vii) a nucleotide sequence complementary to any one of the contiguous spans of (i) to (vi); 
and 

(viii) a polypeptide code comprising a contiguous span of at least 6 amino acids of a 
polypeptide sequence selected from the group consisting of SEQ ID Nos 1 8 to 23. 
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Figure 4 

Markers involved in selected haplotypes 
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SEQUENCE LISTING 



<110> GENSET 

<120> SCHIZOPHRENIA ASSOCIATED GENE, PROTEINS AND BIALLELIC MARKERS 
<130> 53.W02 



<150> 60/168,088 
<151> 1999-11-30' 

<160> 134 

<170> Patent. pm 



<210> 1 

<211> 319608 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 199122. .201122 
<223> 5 f regulatory region 

<220> 

<221> exon 

<222> 201123. .201234 
<223> exon S 

<220> 

<221> exon 

<222> 201123. .201560 
<223> exon S2 



<220> 

<221> exon 

<222> 214676. .214793 
<223> exon T 



<220> 

<221> exon 

<222> 215702. .215746 

<223> exon U 



<220> 

<221> exon 

<222> 216836. .216994 

<223> exon V 



<220> 

<221> exon 

<222> 216836. .217077 

<223> exon V2 



<220> 

<221> exon 
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<222> 217671. .217764 
<223> exon VI 

<220> 

<221> exon 

<222> 227655. .227736 

<223> exon V4 

<220> 

<221> exon 

<222> 238715. .238919 

<223> exon V3 

<220> 

<221> exon 

<222> 240440. .240673 

<223> exon W 

<220> 

<221> exon 

<222> 240440. .241153 

<223> exon W2 

<220> 

<221> exon 

<222> 241072. .241291 

<223> exon X 

<220> 

<221> exon 

<222> 244353. .244561 

<223> exon Y 

<220> 

<221> exon 

<222> 246273. .247802 

<223> exon Z 

<220> 

<221> misc_f eature 

<222> 247803. .249803 

<223> 3' regulatory region 

<220> 

<221> allele 
<222> 8316 

<223> 99-27943-150 : polymorphic base G or C 
<220> 

<221> allele 
<222> 21672 

<223> 99-27935-193 : polymorphic base G or C 
<220> 

<221> allele 
<222> 65485 

<223> 8-128-33 : polymorphic base C or T 
<220> 

<221> allele 
<222> 95396 
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<223> 99-31960-363 : polymorphic base A or G 
<220> 

<221> allele 
<222> 107281 

<223> 99-24656-260 : polymorphic base A or G 
<220> 

<221> allele 
<222> 160640 

<223> 99-24639-163 : polymorphic base A or C 
<220> 

<221> allele 
<222> 160876 

<223> 99-24634-108 : polymorphic base A or T 
<220> 

<221> allele 
<222> 168974 

<223> 99-7652-162 : polymorphic base A or G 
<220> 

<221> allele 
<222> 170810 

<223> 99-16100-147 : polymorphic base A or G 
<220> 

<221> allele 
<222> 173358 

<223> 99-5862-167 : polymorphic base A or G 
<220> 

<221> allele 
<222> 189957 

<223> 99-5919-215 : polymorphic base A or G 
<220> 

<221> allele 
<222> 197163 

<223> 99-24 658-410 : polymorphic base A or G 
<220> 

<221> allele 
<222> 200778 

<223> 8-303-235 : polymorphic base A or G 
<220> 

<221> allele 
<222> 202651 

<223> 8-300-221 : polymorphic base A or G 
<220> 

<221> allele 
<222> 202679 

<223> 8-300-193 : polymorphic base A or G 
<220> 

<221> allele 
<222> 203378 

<223> 8-299-128 : polymorphic base A or T 
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<220> 

<221> allele 
<222> 204138 
<223> 8-296-213 : 

<220> 

<221> allele 
<222> 204605 
<223> 8-252-190 : 

<220> 

<221> allele 
<222> 204934 
<223> 99-24644-194 



polymorphic base A or T 



polymorphic base C or T 



polymorphic base A or G 



<220> 

<221> allele 
<222> 205206 
<223> 8-295-248 



: polymorphic base A or C 



<220> 

<221> allele 
<222> 205329 
<223> 8-295-125 

<220> 

<221> allele 
<222> 206064 
<223> 8-293-130 

<220> 

<221> allele 
<222> 206545 
<223> 8-292-198 

<220> 

<221> allele 

<222> 207313 

<223> 8-251-322 

<220> 

<221> allele 
<222> 208285 
<223> 8-289-322 

<220> 

<221> allele 
<222> 208960 
<223> 8-287-249 



polymorphic base C or T 



polymorphic base A or G 



polymorphic base A or G 



polymorphic base A or G 



: polymorphic base A or G 



: polymorphic base C or T 



<220> 

<221> allele 
<222> 209123 

<223> 8-287-86 : polymorphic base A or T 
<220> 

<221> allele 
<222> 209631 

<223> 8-285-319 : polymorphic base A or G 
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<220> 

<221> allele 
<222> 210361 

<223> 8-283-278 : polymorphic base G or C 
<220> 

<221> allele 
<222> 210463 

<223> 8-283-176 : polymorphic base A or G 
<220> 

<221> allele 
<222> 210486 

<223> 8-283-153 : polymorphic base G or C 
<220> 

<221> allele 
<222> 210583 

<223> 8-283-56 : polymorphic base C or T 
<220> 

<221> allele 
<222> 210879 

<223> 8-282-345 : polymorphic base G or C 
<220> 

<221> allele 
<222> 210964 

<223> 8-282-260 : polymorphic base G or T 
<220> 

<221> allele 
<222> 210979 

<223> 8-282-245 : polymorphic base A or C 
<220> 

<221> allele 
<222> 211050 

<223> 8-282-174 : variable motif AAAGG or GAAGGAAGGAAGGAAGGAAGA 
<220> 

<221> allele 
<222> 211132 

<223> 8-282-92 : polymorphic base A or T 
<220> 

<221> allele 
<222> 211247 

<223> 8-281-367 : polymorphic base A or G 
<220> 

<221> allele 
<222> 211315 

<223> 8-281-299 : polymorphic base A or G 
<220> 

<221> allele 
<222> 211366 

<223> 8-281-248 : polymorphic base G or C 
<220> 
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<221> allele 
<222> 212520 

<223> 8-279-197 : polymorphic base A or C 
<220> 

<221> allele 
<222> 212821 

<223> 8-278-289 : polymorphic base C or T 
<220> 

<221> allele 
<222> 213833 

<223> 8-277-250 : polymorphic base C or T 
<220> 

<221> allele 
<222> 214857 

<223> 8-124-294 : polymorphic base A or G 
<220> 

<221> allele 
<222> 215347 

<223> 8-273-383 : polymorphic base A or G 
<220> 

<221> allele 
<222> 216967 

<223> 8-126-286 or 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 219540 

<223> 99-5897-143 : polymorphic base A or C 
<220> 

<221> allele 
<222> 220836 

<223> 99-24649-186 : polymorphic base A or G 
<220> 

<221> allele 
<222> 223230 

<223> 8-247-337 : polymorphic base A or G 
<220> 

<221> allele 
<222> 224213 

<223> 8-24 6-393 : polymorphic base A or T 
<220> 

<221> allele 
<222> 224792 

<223> 8-24 5-270 : polymorphic base A or G 
<220> 

<221> allele 
<222> 230395 

<223> 8-119-247 : polymorphic base A or G 
<220> 

<221> allele 
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<222> 231677 

<223> 99-15870-400 : polymorphic base A or G 
<220> 

<221> allele 
<222> 235315 

<223> 99-16087-219 : polymorphic base G or C 
<220> 

<221> allele 
<222> 238789 

<223> 8-155-258 : polymorphic base C or T 
<220> 

<221> allele 
<222> 239763 

<223> 8-136-267 or 99-16038-118 : polymorphic base C or T 
<220> 

<221> allele 
<222> 241861 

<223> 99-15880-162 : polymorphic base A or G 
<220> 

<221> allele 
<222> 243700 

<223> 8-271-227 : polymorphic base A or G 
<220> 

<221> allele 
<222> 245264 

<223> 8-267-273 : deletion of GTGGGC 
<220> 

<221> allele 
<222> 245663 

<223> 8-266-131 : polymorphic base A or C 
<220> 

<221> allele 
<222> 245966 

<223> 8-265-208 : polymorphic base A or G 
<220> 

<221> allele 
<222> 245971 

<223> 8-265-203 : polymorphic base G or C 
<220> 

<221> allele 
<222> 247864 

<223> 99-25940-182 : polymorphic base C or T 
<220> 

<221> allele 
<222> 248315 

<223> 99-16032-292 : polymorphic base G or T 
<220> 

<221> allele 
<222> 255848 



7 



WO 01/40493 



PCT/IB00/01507 



<223> 99-16105-152 : polymorphic base A or G 
<220> 

<221> allele 
<222> 260099 

<223> 99-16033-244 : polymorphic base A or G 
<220> 

<221> allele 
<222> 279789 

<223> 99-15875-165 : polymorphic base C or T 
<220> 

<221> allele 
<222> 300365 

<223> 99-16052-214 : polymorphic base A or G 
<220> 

<221> allele 
<222> 312030 

<223> 99-16047-115 : polymorphic base A or G 
<220> 

<221> allele 
<222> 316014 

<223> 99-25993-367 : polymorphic base A or G 
<220> 

<221> primer_bind 
<222> 7938. .7958 
<223> 99-27943. rp 

<220> 

<221> primer_bind 

<222> 8446. .8465 

<223> 99-27943. pu complement 

<220> 

<221> primer_bind 
<222> 21365. .21385 
<223> 99-27935. rp 

<220> 

<221> primer_bind 

<222> 21845. .21864 

<223> 99-27935. pu complement 

<220> 

<221> primer_bind 

<222> 65453. .65471 

<223> 8-128. pu 

<220> 

<221> primer_bind 

<222> 65856. .65874 

<223> 8-128. rp complement 

<220> 

<221> primer_bind 
<222> 95034 . .95053 
<223> 99-31960. pu 
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<220> 

<221> primer_bind 

<222> 95543. .95563 

<223> 99-31960. rp complement 

<220> 

<221> primer_bind 
<222> 107022. .107040 
<223> 99-24656. pu 

<220> 

<221> primer__bind 

<222> 107495. .107513 

<223> 99-24656. rp complement 

<220> 

<221> primer_bind 

<222> 160279. .160298 

<223> 99-24639. rp 

<220> 

<221> primer_bind 
<222> 160770. .160787 
<223> 99-24634. pu 

<220> 

<221> primer_bind 

<222> 160785. . 160802 

<223> 99-24 639. pu complement 

<220> 

<221> primer_bind 

<222> 161240. .161257 

<223> 99-24634. rp complement 

<220> 

<221> primer_bind 
<222> 168813. .168830 
<223> 99-7652. pu 

<220> 

<221> primer_bind 

<222> 169331. .169351 

<223> 99-7652. rp complement 

<220> 

<221> primer__bind 
<222> 170666. . 170686 
<223> 99-16100. pu 

<220> 

<221> primer_bind 

<222> 171153. .171173 

<223> 99-1 6100. rp complement 

<220> 

<221> primer_bind 
<222> 173065. .173085 
<223> 99-5862. rp 
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<220> 

<221> primer_bind 

<222> 173495. .173514 

<223> 99-5862. pu complement 

<220> 

<221> primer_bind 
<222> 189753. .189771 
<223> 99-5919. pu 

<220> 

<221> primerjbind 

<222> 190187. .190207 

<223> 99-5919. rp complement 

<220> 

<221> primerjbind 
<222> 197116. .197135 
<223> 99-24658. rp 

<220> 

<221> primer_bind 

<222> 197555. .197572 

<223> 99-24 658. pu complement 

<220> 

<221> primer_bind 
<222> 200587. .200606 
<223> 8-303. rp 

<220> 

<221> primer_bind 
<222> 200994 . .201012 
<223> 8-303. pu complement 

<220> 

<221> primer_bind 
<222> 202440. .202459 
<223> 8-300. rp 

<220> 

<221> primerjsind 
<222> 202852. .202871 
<223> 8-300. pu complement 

<220> 

<221> primer_bind 
<222> 203061. .203080 
<223> 8-299. rp 

<220> 

<221> primer_bind 

<222> 203486. .203505 

<223> 8-299. pu complement 

<220> 

<221> primer_bind 
<222> 203943. .203962 
<223> 8-296. rp 

<220> 
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<221> primerjbind 
<222> 204345. .204362 
<223> 8-296. pu complement 

<220> 

<221> primer_bind 

<222> 204373. .204392 

<223> 8-252. rp 

<220> 

<221> primer_bind 
<222> 204741. .204758 
<223> 99-24644. pu 

<220> 

<221> primer_bind 
<222> 204777. .204794 
<223> 8-252. pu complement 

<220> 

<221> prime r_bind 
<222> 205034 . .205052 
<223> 8-295. rp 

<220> 

<221> primerjbind 

<222> 205222. .205240 

<223> 99-24 64 4. rp complement 

<220> 

<221> primer_bind 
<222> 205435. .205453 
<223> 8-295. pu complement 

<220> 

<221> primer_bind 
<222> 205768. .205786 
<223> 8-293. rp 

<220> 

<221> primer_bind 
<222> 206175. .206193 
<223> 8-293. pu complement 

<220> 

<221> primerjbind 
<222> 206316. .206333 
<223> 8-292. rp 

<220> 

<221> primer__bind 
<222> 206725. .206742 
<223> 8-292. pu complement 

<220> 

<221> primer_bind 
<222> 207186. .207205 
<223> 8-251. rp 

<220> 

<221> primer_bind 
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<222> 207616. .207634 
<223> 8-251. pu complement 

<220> 

<221> primer_bind 

<222> 208186. .208205 

<223> 8-289. rp 

<220> 

<221> primer_bind 
<222> 208588. .208606 
<223> 8-289. pu complement 

<220> 

<221> primer_bincl 
<222> 208786. .208805 
<223> 8-287. rp 

<220> 

<221> primer_bind 
<222> 209189. .209208 
<223> 8-287. pu complement 

<220> 

<221> primer_bind 

<222> 209520. .209539 

<223> 8-285. rp 

<220> 

<221> primer Jbind 
<222> 209931. .209949 
<223> 8-285. pu complement 

<220> 

<221> primer_bind 
<222> 210219. .210237 
<223> 8-283. rp 

<220> 

<221> primerjoind 
<222> 210620. .210638 
<223> 8-283. pu complement 

<220> 

<221> primer_bind 
<222> 210801. .210820 
<223> 8-282. rp 

<220> 

<221> primer_bind 
<222> 211182. .211199 
<223> 8-281. rp 

<220> 

<221> primer_bind 
<222> 211205. .211223 
<223> 8-282. pu complement 

<220> 

<221> primer_bind 
<222> 211596. .211613 
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<223> 8-281. pu complement 
<220> 

<221> primer__bind 
<222> 212290. .212309 
<223> 8-279. rp 

<220> 

<221> primer_bind 
<222> 212662. .212679 
<223> 8-278. rp 

<220> 

<221> primer_bind 
<222> 212698. .212716 
<223> 8-279. pu complement 

<220> 

<221> primer_bind 
<222> 213091. .213109 
<223> 8-278. pu complement 

<220> 

<221> primer_bind 
<222> 213783. .213803 
<223> 8-277. rp 

<220> 

<221> primer_bind 

<222> 214062. .214082 

<223> 8-277. pu complement 

<220> 

<221> primer_bind 
<222> 214564 . .214581 
<223> 8-124. pu 

<220> 

<221> primer_bind 
<222> 214965. .214983 
<223> 8-124. rp complement 

<220> 

<221> primer_bind 
<222> 215310. .215328 
<223> 8-273. rp 

<220> 

<221> primer_bind 
<222> 215711. .215729 
<223> 8-273. pu complement 

<220> 

<221> primerjbind 
<222> 216683. .216702 
<223> 8-130. rp 

<220> 

<221> primer__bind 
<222> 216687. .216704 
<223> 8-126. pu 
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<220> 

<221> primer_bind 
<222> 217082. .217101 
<223> 8-126. rp complement 

<220> 

<221> primer_bind 
<222> 217091. .217109 
<223> 8-130. pu complement 

<220> 

<221> primer_bind 

<222> 219408. .219425 

<223> 99-5897. pu 

<220> 

<221> primer_bind 

<222> 219882. .219899 

<223> 99-5897. rp complement 

<220> 

<221> primer_bind 

<222> 220505. .220522 

<223> 99-24649. rp 

<220> 

<221> primer_bind 

<222> 221004 . .221021 

<223> 99-24649. pu complement 

<220> 

<221> primer_bind 
<222> 223011. .223028 
<223> 8-247. rp 

<220> 

<221> primerjbind 
<222> 223549. .223566 
<223> 8-247. pu complement 

<220> 

<221> primer_bind 
<222> 223962. .223978 
<223> 8-246. rp 

<220> 

<221> primer_bind 
<222> 224564. .224581 
<223> 8-245. rp 

<220> 

<221> primerjbind 
<222> 224588. .224605 
<223> 8-24 6. pu complement 

<220> 

<221> primer__bind 
<222> 225044. .225061 
<223> 8-245. pu complement 
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<220> 

<221> primer_bind 
<222> 230210. .230227 
<223> 8-119. rp 

<220> 

<221> primer_bind 

<222> 230622. .230641 

<223> 8-119. pu complement 

<220> 

<221> primer_bind 
<222> 231278. .231298 
<223> 99-15870. pu 

<220> 

<221> primer_bind 

<222> 231729. .231747 

<223> 99-15870. rp complement 

<220> 

<221> prime r_bind 

<222> 235081. .235101 

<223> 99-16087. rp 

<220> 

<221> primer_bind 

<222> 235515. .235533 

<223> 99-16087. pu complement 

<220> 

<221> primer_bind 
<222> 238607. .238626 
<223> 8-155. rp 

<220> 

<221> primer_bind 

<222> 239029. .239046 

<223> 8-155. pu complement 

<220> 

<221> primer_bind 
<222> 239405. .239425 
<223> 99-16038. rp 

<220> 

<221> primer_bind 
<222> 239606. .239624 
<223> 8-136. rp 

<220> 

<221> primer_bind 

<222> 239862. .239880 

<223> 99-16038. pu complement 

<220> 

<221> primer_bind 
<222> 240012. .240029 
<223> 8-136. pu complement 

<220> 
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<221> primer_bind 
<222> 241700. .241717 
<223> 99-15880. pu 

<220> 

<221> primer_bind 

<222> 242151. .242171 

<223> 99-15880. rp complement 

<220> 

<221> primerjbind 
<222> 243484 . .243502 
<223> 8-271. rp 

<220> 

<221> primer_bind 
<222> 243909. .243926 
<223> 8-271. pu complement 

<220> 

<221> primer_bind 
<222> 245105. .245123 
<223> 8-267. rp 

<220> 

<221> primer_bind 
<222> 245372. .245389 
<223> 8-266. rp 

<220> 

<221> primerjbind 

<222> 245518. .245535 

<223> 8-267. pu complement 

<220> 

<221> primer_bind 
<222> 245754 . .245772 
<223> 8-265. rp 

<220> 

<221> primerjoind 
<222> 245775. .245793 
<223> 8-266. pu complement 

<220> 

<221> primer_bind 

<222> 246155. .246173 

<223> 8-265. pu complement 

<220> 

<221> primer_bind 
<222> 247513. .247533 
<223> 99-25940. rp 

<220> 

<221> primer_bind 

<222> 248023. .248043 

<223> 99-25940. pu complement 

<220> 

<221> primerjbind 
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<222> 248204 . .248223 
<223> 99-16032. rp 

<220> 

<221> primer_bind 

<222> 248588. .248606 

<223> 99-16032. pu complement 

<220> 

<221> primer_bind 
<222> 255697. .255715 
<223> 99-16105. pu 

<220> 

<221> primer_bind 

<222> 256133. .256152 

<223> 99-16105. rp complement 

<220> 

<221> primer_bind 
<222> 259885. .259902 
<223> 99-16033. rp 

<220> 

<221> primer_bind 

<222> 260324 . .260342 

<223> 99-1 6033. pu complement 

<220> 

<221> primer_bind 
<222> 279626. .279644 
<223> 99-15875. pu 

<220> 

<221> prime r_bind 

<222> 280154 . .280173 

<223> 99-15875. rp complement 

<220> 

<221> primer_bind 
<222> 300153. .300170 
<223> 99-16052. pu 

<220> 

<221> primerjbind 

<222> 300660. .300680 

<223> 99-16052. rp complement 

<220> 

<221> primer_bind 
<222> 311615. .311632 
<223> 99-16047. rp 

<220> 

<221> primer_bind 

<222> 312126. .312144 

<223> 99-16047. pu complement 

<220> 

<221> primer_bind 
<222> 315649. .315668 
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<223> 99-25993. pu 
<220> 

<221> primer_bind 

<222> 316129. .316147 

<223> 99-25993. rp complement 

<220> 

<221> primer_bind 
<222> 8297. .8315 
<223> 99-27943-150. mis 

<220> 

<221> primer_bind 
<222> 8317. .8335 

<223> 99-27943-150. mis complement 
<220> 

<221> primer_bind 
<222> 21653. .21671 
<223> 99-27935-193. mis 

<220> 

<221> primer_bind 

<222> 21673. .21691 

<223> 99-27935-193. mis complement 

<220> 

<221> primer_bind 

<222> 65466. . 65484 

<223> 8-128-33. mis 

<220> 

<221> primerjoind 

<222> 65486. . 65504 

<223> 8-128-33. mis complement 

<220> 

<221> primer__bind 
<222> 95377. .95395 
<223> 99-31960-363. mis 

<220> 

<221> primer_bind 

<222> 95397. .95415 

<223> 99-31960-363. mis complement 

<220> 

<221> primer_bind 
<222> 107262. .107280 
<223> 99-24656-260. mis 

<220> 

<221> primer_bind 

<222> 107282. .107300 

<223> 99-24 656-260. mis complement 

<220> 

<221> primerjoind 
<222> 160621. . 160639 
<223> 99-24639-163. mis 



18 



WO 01/40493 



PCT/IBOO/01507 



<220> 

<221> primer_bind 

<222> 160641. .160659 

<223> 99-24639-163. mis complement 

<220> 

<221> primer_bind 
<222> 160857. .160875 
<223> 99-24634-108. mis 

<220> 

<221> primer_bind 

<222> 160877. . 160895 

<223> 99-24634-108. mis complement 

<220> 

<221> primerjbind 
<222> 168955. .168973 
<223> 99-7652-162. mis 

<220> 

<221> primer_bind 

<222> 168975. .168993 

<223> 99-7652-162. mis complement 

<220> 

<221> primer_bind 
<222> 170791. .170809 
<223> 99-16100-147. mis 

<220> 

<221> primer_bind 

<222> 170811. .170829 

<223> 99-16100-147. mis complement 

<220> 

<221> primer_bind 
<222> 173339. . 173357 
<223> 99-5862-167. mis 

<220> 

<221> primer_bind 

<222> 173359. .173377 

<223> 99-5862-167. mis complement 

<220> 

<221> primer_bind 

<222> 189938. . 189956 

<223> 99-5919-215. mis 

<220> 

<221> primer_bind 

<222> 189958. .189976 

<223> 99-5919-215. mis complement 

<220> 

<221> primer_bind 
<222> 197144 . . 197162 
<223> 99-24658-410. mis 
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<220> 

<221> primer_bind 

<222> 197164 . . 197182 

<223> 99-24658-410. mis complement 

<220> 

<221> primer_bind 
<222> 200759. .200777 
<223> 8-303-235. mis 

<220> 

<221> primer_bind 

<222> 200779. .200797 

<223> 8-303-235. mis complement 

<220> 

<221> primer_bind 
<222> 202632. .202650 
<223> 8-300-221. mis 

<220> 

<221> primer_bind 

<222> 202652. .202670 

<223> 8-300-221. mis complement 

<220> 

<221> primer_bind 

<222> 202660. .202678 

<223> 8-300-193. mis 

<220> 

<221> primer_bind 

<222> 202680. .202698 

<223> 8-300-193. mis complement 

<220> 

<221> primer_bind 
<222> 203359. .203377 
<223> 8-299-128. mis 

<220> 

<221> primer_bind 

<222> 203379. .203397 

<223> 8-299-128. mis complement 

<220> 

<221> primer__bind 
<222> 204119. .204137 
<223> 8-296-213. mis 

<220> 

<221> primer_bind 

<222> 204139. .204157 

<223> 8-296-213. mis complement 

<220> 

<221> primerjsind 
<222> 204586. .204604 
<223> 8-252-190. mis 

<220> 
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<221> primer_bind 

<222> 204606. .204624 

<223> 8-252-190. mis complement 

<220> 

<221> primer_bind 
<222> 204915. .204933 
<223> 99-24644-194. mis 

<220> 

<221> primer_bind 

<222> 204935. .204953 

<223> 99-24644-194. mis complement 

<220> 

<221> primer_bind 
<222> 205187. .205205 
<223> 8-295-248. mis 

<220> 

<221> primer_bind 

<222> 205207. .205225 

<223> 8-295-248. mis complement 

<220> 

<221> primer_bind 
<222> 205310. .205328 
<223> 8-295-125. mis 

<220> 

<221> primer_bind 

<222> 205330. .205348 

<223> 8-295-125. mis complement 

<220> 

<221> primer_bind 
<222> 206045. .206063 
<223> 8-293-130. mis 

<220> 

<221> primer_bind 

<222> 206065. .206083 

<223> 8-293-130 .mis complement 

<220> 

<221> primer_bind 
<222> 206526. .206544 
<223> 8-292-198. mis 

<220> 

<221> primer_bind 

<222> 206546. .206564 

<223> 8-292-198. mis complement 

<220> 

<221> primer_bind 
<222> 207294. .207312 
<223> 8-251-322. mis 

<220> 

<221> primer bind 
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<222> 207314 . .207332 

<223> 8-251-322. mis complement 

<220> 

<221> primer_bind 
<222> 208266. .208284 
<223> 8-289-322. mis 

<220> 

<221> primer_bind 

<222> 208286. .208304 

<223> 8-289-322. mis complement 

<220> 

<221> primer_bind 
<222> 208941. .208959 
<223> 8-287-249. mis 

<220> 

<221> primer_bind 

<222> 208961. .208979 

<223> 8-287-249. mis complement 

<220> 

<221> primer_bind 
<222> 209104 . .209122 
<223> 8-287-86. mis 

<220> 

<221> primer_bind 

<222> 209124 . .209142 

<223> 8-287-86. mis complement 

<220> 

<221> primer_bind 
<222> 209612. .209630 
<223> 8-285-319. mis 

<220> 

<221> primer_bind 

<222> 209632. .209650 

<223> 8-285-319. mis complement 

<220> 

<221> primer_bind 
<222> 210342. .210360 
<223> 8-283-278. mis 

<220> 

<221> primer_bind 

<222> 210362. .210380 

<223> 8-283-278. mis complement 

<220> 

<221> primer_bind 
<222> 210444 . .210462 
<223> 8-283-176. mis 

<220> 

<221> primer_bind 
<222> 210464. .210482 



22 



WO 01/40493 



PCT/IBOO/01507 



<223> 8-283-176. mis complement 
<220> 

<221> primer_bind 

<222> 210467. .210485 

<223> 8-283-153. mis 

<220> 

<221> primer_bind 

<222> 210487. .210505 

<223> 8-283-153. mis complement 

<220> 

<221> primer_bind 
<222> 210564 . .210582 
<223> 8-283-56. mis 

<220> 

<221> primer_bind 

<222> 210584 . .210602 

<223> 8-283-56. mis complement 

<220> 

<221> primer_bind 

<222> 210860. .210878 

<223> 8-282-345. mis 

<220> 

<221> primerjbind 

<222> 210880. .210898 

<223> 8-282-345. mis complement 

<220> 

<221> primer_bind 
<222> 210945. .210963 
<223> 8-282-260. mis 

<220> 

<221> primer_bind 
<222> 210960. .210978 
<223> 8-282-245. mis 

<220> 

<221> primerjbind 

<222> 210965. .210983 

<223> 8-282-260. mis complement 

<220> 

<221> primer_bind 

<222> 210980. .210998 

<223> 8-282-245. mis complement 

<220> 

<221> primer_bind 
<222> 211113. .211131 
<223> 8-282-92. mis 

<220> 

<221> primer_bind 

<222> 211133. .211151 

<223> 8-282-92. mis complement 
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<220> 

<221> primer_bind 

<222> 211228. .211246 

<223> 8-281-367. mis 

<220> 

<221> primer_bind 

<222> 211248. .211266 

<223> 8-281-367 .mis complement 

<220> 

<221> primer_bind 
<222> 211296. .211314 
<223> 8-281-299. mis 

<220> 

<221> primer_bind 

<222> 211316. .211334 

<223> 8-281-299. mis complement 

<220> 

<221> primer_bind 

<222> 211347. .211365 

<223> 8-281-248. mis 

<220> 

<221> primer_bind 

<222> 211367. .211385 

<223> 8-281-248. mis complement 

<220> 

<221> primer_bind 
<222> 212501. .212519 
<223> 8-279-197. mis 

<220> 

<221> primer_bind 

<222> 212521. .212539 

<223> 8-279-197 .mis complement 

<220> 

<221> prime r_bind 
<222> 212802. .212820 
<223> 8-278-289. mis 

<220> 

<221> primer_bind 

<222> 212822. .212840 

<223> 8-278-289. mis complement 

<220> 

<221> primer_bind 
<222> 213814 . .213832 
<223> 8-277-250. mis 

<220> 

<221> primer_bind 

<222> 213834 . .213852 

<223> 8-277-250. mis complement 
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<220> 

<221> primer_bind 
<222> 214838. .214856 
<223> 8-124-294 .mis 

<220> 

<221> primer_bind 

<222> 214858. .214876 

<223> 8-124-294 .mis complement 

<220> 

<221> primer_bind 
<222> 215328. .215346 
<223> 8-273-383. mis 

<220> 

<221> primer_bind 

<222> 215348. .215366 

<223> 8-273-383. mis complement 

<220> 

<221> primer_bind 
<222> 216948. .216966 
<223> 8-126-286. mis 

<220> 

<221> primer_bind 
<222> 216948. .216966 
<223> 8-130-143. mis 

<220> 

<221> primerjDind 

<222> 216968. .216986 

<223> 8-126-286. mis complement 

<220> 

<221> primer_bind 

<222> 216968 . .216986 

<223> 8-130-143. mis complement 

<220> 

<221> primer_bind 
<222> 219521. .219539 
<223> 99-5897-143. mis 

<220> 

<221> primer_bind 

<222> 219541. .219559 

<223> 99-5897-143. mis complement 

<220> 

<221> primer_bind 
<222> 220817. .220835 
<223> 99-24649-186. mis 

<220> 

<221> primer_bind 

<222> 220837. .220855 

<223> 99-24649-186. mis complement 

<220> 
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<221> primer__bind 
<222> 223211. .223229 
<223> 8-247-337. mis 

<220> 

<221> primer_bind 

<222> 223231. .223249 

<223> 8-247-337. mis complement 

<220> 

<221> primer_bind 
<222> 224194 . .224212 
<223> 8-246-393. mis 

<220> 

<221> primer_bind 

<222> 224214 . .224232 

<223> 8-246-393. mis complement 

<220> 

<221> primer_bind 
<222> 224773. .224791 
<223> 8-245-270. mis 

<220> 

<221> primer_bind 

<222> 224793. .224811 

<223> 8-245-270. mis complement 

<220> 

<221> primer_bind 

<222> 230376. .230394 

<223> 8-119-247. mis 

<220> 

<221> primer_bind 

<222> 230396. .230414 

<223> 8-119-247. mis complement 

<220> 

<221> primer_bind 
<222> 231658. .231676 
<223> 99-15870-400. mis 

<220> 

<221> primer_bind 

<222> 231678. .231696 

<223> 99-15870-400. mis complement 

<220> 

<221> primerjbind 
<222> 235296. .235314 
<223> 99-16087-219. mis 

<220> 

<221> primer_bind 

<222> 235316. .235334 

<223> 99-16087-219. mis complement 

<220> 

<221> primer_bind 
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<222> 238770. .238788 
<223> 8-155-258. mis 

<220> 

<221> primer_bind 

<222> 238790. .238808 

<223> 8-155-258 .mis complement 

<220> 

<221> primer_bind 
<222> 239744. .239762 
<223> 8-136-267. mis 

<220> 

<221> primer_bind 
<222> 239744 . .239762 
<223> 99-16038-118. mis 

<220> 

<221> primer_bind 

<222> 239764. .239782 

<223> 8-136-267 .mis complement 

<220> 

<221> primer_bind 

<222> 239764. .239782 

<223> 99-16038-118. mis complement 

<220> 

<221> primer_bind 
<222> 241842. .241860 
<223> 99-15880-162. mis 

<220> 

<221> primer_bind 

<222> 241862. .241880 

<223> 99-15880-162. mis complement 

<220> 

<221> primer_bind 
<222> 243681. .243699 
<223> 8-271-227. mis 

<220> 

<221> primer_bind 

<222> 243701. .243719 

<223> 8-271-227 .mis complement 

<220> 

<221> primer Jbind 

<222> 245644 . .245662 

<223> 8-266-131. mis 

<220> 

<221> primer_bind 

<222> 245664. .245682 

<223> 8-266-131 .mis complement 

<220> 

<221> primer_bind 
<222> 245947. .245965 
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<223> 8-265-208. mis 
<220> 

<221> primer_bind 
<222> 245952. .245970 
<223> 8-265-203. mis 

<220> 

<221> primer_bind 

<222> 245967. .245985 

<223> 8-265-208. mis complement 

<220> 

<221> primer_bind 

<222> 245972. .245990 

<223> 8-265-203 .mis complement 

<220> 

<221> primer_bind 
<222> 247845. .247863 
<223> 99-25940-182. mis 

<220> 

<221> primer_bind 

<222> 247865. .247883 

<223> 99-25940-182. mis complement 

<220> 

<221> primer_bind 
<222> 248296. .248314 
<223> 99-16032-292. mis 

<220> 

<221> primer_bind 

<222> 248316. .248334 

<223> 99-16032-292. mis complement 

<220> 

<221> primer_bind 
<222> 255829. .255847 
<223> 99-16105-152. mis 

<220> 

<221> primer__bind 

<222> 255849. .255867 

<223> 99-16105-152. mis complement 

<220> 

<221> primer_bind 

<222> 260080. .260098 

<223> 99-16033-244. mis 

<220> 

<221> primer_bind 

<222> 260100. .260118 

<223> 99-16033-244 .mis complement 

<220> 

<221> primer_bind 
<222> 279770. .279788 
<223> 99-15875-165. mis 
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<220> 

<221> primer_bind 

<222> 279790. .279808 

<223> 99-15875-165. mis complement 

<220> 

<221> primer_bind 
<222> 300346. .300364 
<223> 99-16052-214 .mis 

<220> 

<221> primer_bind 

<222> 300366. .300384 

<223> 99-16052-214 .mis complement 

<220> 

<221> primerjbind 
<222> 312011. .312029 
<223> 99-16047-115. mis 

<220> 

<221> primer_bind 

<222> 312031. .312049 

<223> 99-16047-115. mis complement 

<220> 

<221> primerjbind 
<222> 315995. .316013 
<223> 99-25993-367. mis 

<220> 

<221> primerjbind 

<222> 316015. .316033 

<223> 99-25993-367. mis complement 

<220> 

<221> misc_binding 

<222> 8304. .8328 

<223> 99-27943-150. probe 

<220> 

<221> misc_binding 
<222> 21660. .21684 
<223> 99-27935-193. probe 

<220> 

<221> misc_binding 
<222> 65473. .65497 
<223> 8-128-33. probe 

<220> 

<221> misc_binding 
<222> 95384.. 95408 
<223> 99-31960-363. probe 

<220> 

<221> misc_binding 
<222> 107269. . 107293 
<223> 99-24656-260. probe 
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<220> 

<221> misc_binding 
<222> 160628. .160652 
<223> 99-24639-163. probe 

<220> 

<221> misc_binding 
<222> 160864 . .160888 
<223> 99-24634-108. probe 

<220> 

<221> misc_binding 
<222> 168962. .168986 
<223> 99-7652-162. probe 

<220> 

<221> misc_binding 

<222> 170798. .170822 

<223> 99-16100-147. probe 

<220> 

<221> misc_binding 
<222> 173346. .173370 
<223> 99-5862-167. probe 

<220> 

<221> misc_binding 
<222> 189945. .189969 
<223> 99-5919-215. probe 

<220> 

<221> misc_binding 
<222> 197151 . .197175 
<223> 99-24658-410. probe 

<220> 

<221> misc_binding 
<222> 200766. .200790 
<223> 8-303-235. probe 

<220> 

<221> misc_binding 
<222> 202639. .202663 
<223> 8-300-221. probe 

<220> 

<221> misc_binding 
<222> 202667. .202691 
<223> 8-300-193. probe 

<220> 

<221> misc__binding 
<222> 203366. .203390 
<223> 8-299-128. probe 

<220> 

<221> misc_binding 
<222> 204126. .204150 
<223> 8-296-213. probe 

<220> 
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<221> misc_binding 
<222> 204593. .204617 
<223> 8-252-190. probe 

<220> 

<221> misc_binding 
<222> 204922. .204946 
<223> 99-24644-194 .probe 

<220> 

<221> misc_binding 
<222> 205194. .205218 
<223> 8-295-248. probe 

<220> 

<221> misc_binding 
<222> 205317. .205341 
<223> 8-295-125. probe 

<220> 

<221> misc_binding 
<222> 206052. .206076 
<223> 8-293-130. probe 

<220> 

<221> misc_binding 
<222> 206533. .206557 
<223> 8-292-198. probe 

<220> 

<221> misfc_binding 
<222> 207301. .207325 
<223> 8-251-322. probe 

<220> 

<221> misc_binding 
<222> 208273. .208297 
<223> 8-289-322. probe 

<220> 

<221> misc_binding 

<222> 208948. .208972 

<223> 8-287-249. probe 

<220> 

<221> misc_binding 
<222> 209111. .209135 
<223> 8-287-86. probe 

<220> 

<221> misc_binding 
<222> 209619. .209643 
<223> 8-285-319. probe 

<220> 

<221> misc_binding 
<222> 210349. .210373 
<223> 8-283-278. probe 

<220> 

<221> misc_binding 
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<222> 210451. .210475 
<223> 8-283-176. probe 

<220> 

<221> misc_binding 
<222> 210474 . .210498 
<223> 8-283-153. probe 

<220> 

<221> misc_binding 
<222> 210571. .210595 
<223> 8-283-56. probe 

<220> 

<221> misc_binding 
<222> 210867. .210891 
<223> 8-282-345. probe 

<220> 

<221> misc_binding 
<222> 210952. .210976 
<223> 8-282-260. probe 

<220> 

<221> misc_binding 
<222> 210967. .210991 
<223> 8-282-245. probe 

<220> 

<221> miscjainding 
<222> 211120. .211144 
<223> 8-282-92. probe 

<220> 

<221> misc_binding 
<222> 211235. .211259 
<223> 8-281-367. probe 

<220> 

<221> misc_binding 
<222> 211303. .211327 
<223> 8-281-299. probe 

<220> 

<221> misc_binding 
<222> 211354 . .211378 
<223> 8-281-248. probe 

<220> 

<221> misc_binding 
<222> 212508. .212532 
<223> 8-279-197. probe 

<220> 

<221> misc_binding 
<222> 212809. .212833 
<223> 8-278-289. probe 

<220> 

<221> miscjbinding 
<222> 213821. .213845 
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<223> 8-277-250. probe 
<220> 

<221> miscjoinding 
<222> 214845. .214869 
<223> 8-124-294 .probe 

<220> 

<221> misc_binding 
<222> 215335. .215359 
<223> 8-273-383. probe 

<220> 

<221> misc_binding 
<222> 216955. .216979 
<223> 8-126-286. probe 

<220> 

<221> misc_binding 

<222> 216955. .216979 

<223> 8-130-143. probe 

<220> 

<221> misc_binding 
<222> 219528. .219552 
<223> 99-5897-143. probe 

<220> 

<221> misc_binding 
<222> 220824 . .220848 
<223> 99-24649-186. probe 

<220> 

<221> misc_binding 
<222> 223218. .223242 
<223> 8-247-337. probe 

<220> 

<221> misc_binding 
<222> 224201. .224225 
<223> 8-246-393. probe 

<220> 

<221> misc_binding 

<222> 224780. .224804 

<223> 8-245-270. probe 

<220> 

<221> misc_binding 
<222> 230383. .230407 
<223> 8-119-247. probe 

<220> 

<221> miscjbinding 
<222> 231665. .231689 
<223> 99-15870-400. probe 

<220> 

<221> misc_binding 
<222> 235303. .235327 
<223> 99-16087-219. probe 
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<220> 

<221> mi sc__bi riding 
<222> 238777. .238801 
<223> 8-155-258. probe 

<220> 

<221> misc_binding 

<222> 239751. .239775 

<223> 8-136-267. probe 

<220> 

<221> misc_binding 
<222> 239751. .239775 
<223> 99-16038-118. probe 

<220> 

<221> misc_binding 
<222> 241849. .241873 
<223> 99-15880-162. probe 

<220> 

<221> misc_birtding 
<222> 243688. .243712 
<223> 8-271-227. probe 

<220> 

<221> misc_binding 

<222> 245651. .245675 

<223> 8-266-131. probe 

<220> 

<221> misc_binding 
<222> 245954 . .245978 
<223> 8-265-208. probe 

<220> 

<221> misc_binding 
<222> 245959. .245983 
<223> 8-265-203. probe 

<220> 

<221> misc_binding 
<222> 247852. .247876 
<223> 99-25940-182. probe 

<220> 

<221> misc_binding 

<222> 248303. .248327 

<223> 99-16032-292. probe 

<220> 

<221> misc_binding 
<222> 255836. .255860 
<223> 99-16105-152. probe 

<220> 

<221> miscjbinding 
<222> 260087. .260111 
<223> 99-16033-244. probe 
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<220> 

<221> misc_binding 
<222> 279777. .279801 
<223> 99-15875-165. probe 

<220> 

<221> misc_binding 
<222> 300353. .300377 
<223> 99-16052-214 .probe 

<220> 

<221> misc_binding 
<222> 312018. .312042 
<223> 99-16047-115. probe 

<220> 

<221> misc_binding 

<222> 316002. .316026 

<223> 99-25993-367. probe 

<220> 

<221> misc_f eature 

<222> 4222, 1124 5, 27205 . . 27206, 27212 . . 27213, 27240, 27347, 27378, 28258 

108818, 128283, 137062 137063, 164757 , 17214 8 . . 172150, 180820, 184 929 
194 4 60,200104, 207061. .207062,20794 9,239085, 24 9787, 24 9806, 250156 

250189,250841,251114,264938,265019,265078,274 640,277770,314308 
318680,318731 
<223> n=a, g, c or t 

<400> 1 

cgggcgaatt cgagctcggt acccggggat catctttgtc atagattgta gtaaaaataa 60 

tctgtgaaaa gtgactgatt attgacaggt agccttgggt atgcagctca agtaggcatt 120 

ataaaataaa aaacatcaca ttccatgtat tttcaaaaaa agcttctggc ttactcaaaa 180 

ggtaagtgtt ctttaaacaa cttcaatgga aaaggagttt cttccaaatg cttggagaac 240 

tccttctcaa gtccggaata aagcaaggtc tacagagaca aagtataaat aaaaccatgg 300 

tgctgagctg gtaccactat tgggctgcta agcactgcta agcattgcct tcagatgagt 360 

tacctagcct ttgtaaagct cagattccac agctataaac tgagatgaat aataatacac 420 

aaaacactaa gctgttgagt agattaaatg aaattattta tgtagagcaa atagaacagt 480 

actgtacctt ataaaattta aaaccaaaat gtaaattctg atttttaccc aagggtcaac 540 

tccgcacaat tttgcaatgg acccttcccc ggccctccag aagctgaata tcgttgtgta 600 

tcatatccaa attcagtatc accagggcat agagtaagtg ccaactgtga ggccatagct 660 

atgcgtccac tcccctgaaa cacattgtgt accttcattc ttggagttga gtctttgcat 720 

ttcatcctcc agagtgagaa tagactgcgt gttggagagg tcagggaaat tatgtctttc 780 

tatgtctagc cctttatagg tcgattattt gaatgcttga cagcatctgt ccacttatct 840 

atccctgcaa taataaaata cgggactaaa gtagctcgta tctctagtgg gagagtgcat 900 

atattttata cgtcaattct ttaaaagggg aaattctttc agagcttcag aaaatatcac 960 

ggaagagtta aaagaatgtt tgagaagaac atgtctgagt attacctact agcattacaa 1020 

ttttactgct ttaaagtaat tataattttc attgcaggat ctattaaaaa ggcttttacc 1080 

tgacggtaat tcttcttgta cttacaggtc attattacaa tgaagggaag aaaggaagac 1140 

atgaagggga aatgggacta tttaccaagt gtatctgcat agactaaagg agacaagagg 1200 

aagtactgag caccttgatt cctgcagata attatttaag aaccataaga attctttcct 1260 

agcttctcat aagcctgctc cttcaaaggg tgagtagttg acatgatttt acatcccatt 1320 

tatatttctt tattagataa ttttatttca aatctagtct tattttttct gaaccttctg 1380 

ggatatataa cattttatat tatcatttat gtcatcaaga atgtagattt atattctttt 1440 

ctattcacat gtttatgtaa ataattgaac acctctaaat gacagatact atcattttat 1500 

atattgttaa taagtatctt catggctggg cgtggtggct cacgcccgta atcccagcac 1560 

tttgggaggc cgaggtggga ggatcacctg aggtcaggag tttgagatca gcctggcgaa 1620 

cgtggtgaaa tcccatctct actaaaaaca acaaaaatta gctgggcatg gtagcatgcg 1680 

cctatagtcc cagctactag ggaggccaag gcaggagaat cgatcgctca aacctgggag 1740 

gcagaggttt cagtgagctg atatcgtgcc actgcactct ggcctgggca acagaatgag 1800 

accctgcttc aataaataaa taaattaatt aattaattaa taaagtatct taatgacaaa 1860 
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ca^c^tt^ tatataaat ^ ttta «ttta catatgggaa gagagatctg aaattttaag 1920 

InttlM t ccaaaaacct atttttttca aagttcctcc aacatctata aaagaagtaa 1980 

agtcacttat agatctatct tacttcaggg cctatgtttt cctctaccat acaacactgc 2040 

acattcacag acacagaaat cttttacatt tctaaaataa ataaaacatc caaacttcag 2100 

"taaaaaaa atggatctac agatccacca tagactaaac gaccaaacat agtcagagg? 2160 

gtttacttgt ttgtttcggt cttaagggaa tcttcaaact accctagagc c? C gtgta?c 2220 

caatgagagc agtctggaca ttgaatgaat ggtgatgtca ttaaatgagc tatggagcaa lllo 

gatgtaagaa cacgctttag agtaaataac atatttggct tctgaggtat aaaaagggaa 2340 

ggaggctttc tcaggacatt tgggtacctg gtccttcata ctcaggaggt ttttcS?" 2400 

tcatgtcatg gaataaacta tgctttaaag aaatgctgcg tcagcagaSa gtgttt'aaa 246 l 

ttJfS?;S fc ! g h tCa , tagC "tg.aatta tgtcaaatat cattttaact aagagagcca 2520 

ttgtctttcc atatatttat ctagaagaaa atttactgga caaaataaat atagaacaac 2580 

3 ^cattaatg gtcggatagg cagttactat aaaatatgaa taa?ctaaaa 2640 

atcattttag ccatttgttt ttaataaatc tcacttgtga ttgatttatt aaaaacaggg 2700 

cctccagaag aaagaaactt aatacctttc ctccatcatt atgaaggcac tagtactgac 2760 

gtaactaaaa ttagatggca gcttgaggga ggaagagatg atgtagagat ggtgtgtaat 2820 

ctgtgtaaac agtctctcaa ctgtctagag ttagctgtac gaatctgttg ??ggc?c«g 2880 
HES?™? Caagat3tgt "gfaggac aaacatcctt aatttgctta gcaagtcgta 

tgtatttgat agaccaccaa ttttatttta ttttatttta tttttgagat ggagtcttgc 3000 

tctgtcacct gggctggagt gcagtggcgc gatctcggct cactgcaaac ?ccLSaJo 30M 

little gatt f tct 9 «=ctcagcctc ccgagtagct gggattacaa gtgctcaaaa 3120 

ccacccccgg ttaatgtttt tttcttttct tttcttttct tttcttttct tttcttttct 3180 

tttlttT" "'f tttttta 9ta gtagacacag ggtttcactJ tgttggccag 32 S 

gctggtctca aactcctgac ctcgtgatct gcccacctcg gcctcccaaa gtgcigggJt 3300 

tacgggatta caagcgtgag ccactgcacc agccctgatt tttaaaaggg at?ttagaa£ 3360 

ttcagttgaa catgtgtttt tgaagtaact ccatattgaa caggggaa?c agag^aag'a 3420 

aaccactaca ggaaaatcag gaaggcagac tccaatttat gttttctgtg tataacc?ac lllo 

tttaataaag aattaaaaat cataagagta gttttatcta gtggcctaga acttg.tt" lilo 

gaaaagtaag acaaaggaaa tctgacctca tgccagtgtc tggaagcc?t acaggaac^g 36M 

aaaggaccca cgtggaacag ctggcaactg aaactcatct tgtcaatttc aag?gtagtc 3660 

cctcttccca.attcaccgga gaaaacacag ccagctttgc atacagtgtc tggtgtcacc 3720 ■ 

a ?c? aaaa t a r^^r" tgCatacaat atctggtgtc agaatgacca aacgattggc 3780 

ttctatcctt caggattgtg aacagagagc caaggcatac aaccagtatg cccactccgt 3840 

cagctgcgag acagaggaga taaagaatgc tgaacattat gggtcactgc tttgaagtat 3900 

tcagaggcca aaatagttca aaataattgt gttaatcatt aaaagtagaa gaagtggaag 3960 

aa aaa t a a^a IZlllTtt f?""*" ^atgtgtg gagatcagaa attLlaaaa 4020 

Stttcccctt ttaaactttg agaaaagacc ataagaaccg gtgttttagt 4080 

aaaaa^J " agagtaat "actcataa aaagaaaaga actcatgata ccgtgtaaac 4140 

* t taaat ^ ct g9 tgtaaataag attaagggag gacaaagatg aaggaaagca 4200 

ggacatatca aaaaacaaat anaataagaa attttcagag cttgaatata caaacaggca 260 

ggttcaatgg aaacctaatg caatgaatgg gacaattaat gcttttaaac 4320 

t™^ 9 a = caca ? aa a ataccaacag aaaaatctgg tcacgtacaa gagataagag 4380 

tcagggttcc atggatgttc ttcttagatg agaaggacac ttcactttta tggcattgt? 4440 

tat a t aa tac ^ a3CCCaa gtttcatcat Wcaagcct cagccaaaga aaaattgaga 5M 

tgtattttac aacatacccg ttcaggccct ttaaaaatgt caaagttggc caggcaccat 4560 

ggcttccacc tgtaatccca gcactttggg aggctgagat gggtggatca tttgaggtca 4620 

ggagttcgag accagcctgg acaatatggg gaaaccccaa ctgtactaaa aaaaga?aac 4 680 

aaaattagcc aggtgtggtg gtgtgggcct gtaatcccag ctacttgaga ggctgaggca 4740 

«S££Sc a, 93300 " 9 gagCCagga 9 gtggaggttg cagtgagcca agatca?gcc 4B00 

actgtattcc aacctgggcg acagagcaaa actccatctc cccccaaaaa aagcaaagtc 4860 

at g a aa t a a Ca ! a9taaga f "TOaactgt cagattagag gggactgagg aaLatgaaa 920 

attagatgaa atagggtata cagtttagaa ttatagaaca gaaagcagga catcagttga 4980 

™ 9 aaatacaaat aaagtctatt ttagttaata ttgttgtaL aat.cLaS lllo 

allllTJt aacaaataaa atatcattat gtaagatggc aacttcagga aaagttgggg 5100 

? aaagata ! a "^atact ttgtattatc ttagaaacta ttatgtcaat attttaata? 5160 

aaaattatta gaaattgtgt cagaaacact ggatgttaga agacaatgga ggaatgcctc 5220 

aaaaaatcta cagagaactt cttttcaatc tatcatcata gcatcaatca agtatgagct 5280 

tcaaattatg atattttata atatgagaat aattctgaaa atttacctta taccaatK "SS 

tlttltTal a % taCttta ^ 9«9tgctat agtcaaacag aggaataaac taagaaaaag 5400 

! a ' g " g ' ga ^atacagaaa aatatataga aacaatttag aaaaataata aaaagtagtc 54 60 

ctatgataat agcaatataa gaatgtttag attggaatag aagatgaatg ccctcaagag 5520 
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tataacctct gaaagaaaaa ttaaataaat aaaaaaattc tgacaggata cctgatcaga 5580 

tagagatgtg acaaaccagt atgttgaaag gtgatgacag aaagagagaa cggaaagaga 5640 

gaggcaggaa ggaaagggga aaaaaagaaa gaaatgaaag gcgattctca caccagttag 5700 

aatggcaatt attaaaaagt caggaaacaa caggtgctag acaggatgtg gagaaatagg 5760 

aacactttta cactattggt gggactgtaa actagttcaa ccattgtgga agtcagtgtg 5820 

gcgattcctc agggatctag aactagaaat accatttgac ccagtcatcc cattactggc 5880 

tatataccca aaggattata aatcatgctt ctataaagac acatgcacac gtatgtttat 5940 

tgcggcacta ttcacaatag caaagacttg gaaccaaccc aaatgtccaa caatgataga 6000 

ctgggttaag aaaatgtggc acatatacac catggaatac tatgcagcca taaaaaatga 6060 

tgagttcatg tcctttgtag ggacatggat gaagctggaa accatcattc tcagcaaact 6120 

atagcaagga cagaaaacca aacaccacat gttctcactc ataggtggga attgaataat 6180 

gggaacacct ggacacagga aggggaacat cacacaccag ggcctgttgt ggggtggggg 6240 

gaggggggag ggatagcatt aggagatata cctaatgcta aatgacgagt taatgggtgc 6300 

agcacaccag catggcacat gtatacatat gtaactaacc tgtacgttgt gcccatgtac 6360 

cctagaactt aaagtataat atatatatta tatatatata aagaaattcc aagaaaatca 6420 

aaaggctgca tacaaatgaa atactatatg accattattt ccttcttggc tttttcaaga 6480 

acaacattta cttagctacc acactgaaat tactattcat taattttaag taaactggca 6540 

ttgaaaatat ggttaatctg gtttttcttt ttttttttga gacagggtct cgctctgtta 6600 

cccaggctgg agtgcagtgc tgtgatctta gctcactgca acctctgcct cccaggttcc 6660 

aacaatcctc ccacctcagc ttcccccagt agctgggatt acagctgtgt gccaccatga 6720 

acagctaatt ttatattttt agtagagatg gggttcgccc tgttggccag gctggcctcg 6780 

aactcctgac ttcaagtgat ctgcctgcct cagcctcccc aagtgttggg attacaggtg 6840 

tgagccaccg tgcccagccc agaaatgtgt ttagtagttt gaaggggcaa gtaccaggtg 6900 

ctaagagttc acaggggagg gatactggac ttagaccttc ggaatcaaga aaagctttct 6960 

atagaaagta ctgcctaagc tgaagactga acactgacca ggctgaatga aagatgtagt 7020 

gctggagagg atttcagtca gaaaaaacat ttgagaaggt ccagaagaaa aataaaacaa 7080 

tttatttcaa ggtactgaaa taaattgaga ttaaggggag tgaagagtca aggtctggag 7140 

aggtattgag ttgtcaagaa atacagtttg agaggtaaag gccatattag gaagtaatat 7200 

atttttaaac ttttgaagct tattctaagg ctaataacaa agtggtgtag gcaggaaagt 7260 

tgcagttatt ataaaacttt cagaaagatt tatttaaatt aattggaatt gagaaagaaa 7320 

ggcattattt tacgtaataa aaatatttat ttgttgtgta aaaaataaat agtttggcct 7380 

agaggagaac agggccagta aggaagagga gaaatgcata gatataaagg gtgttaggat 7440 

aaagcataag taagacttaa tgataattta aacttaagga acaaggaaaa tgaaggagta 7500 

atgaatactt gagtcacttt gtagatagag gcaccaatca ctcaactcag acagagtaga 7560 

ctagaggtgg ttttggggaa gctaaagact ccacttttag atctctggaa tatccaagat 7620 

gggaattcaa ttaggctatg atttcaagtt caaacaagag aaacgggatg gagatatttt 7680 

tagataccat ctgataatga tragtgattg aagccatggg aatggatatt ttgtcaatgt 7740 

tcttttgaga gctgtatcct tccgtagtta tctgatgggt tttttaaatg gcgtaaaata 7800 

atcagtaatt tagaagtttc tataaagaaa gttcggctac aaatacttat cctttctctt 7860 

tcattctcaa accatgcatt aaccacctat taatgtgttt agaaaaacag taagtaattc 7920 

agggttcatg ctgacagaga agacccacat aagcaaacct cgctataata caaggagcta 7980 

aattccccaa tataagaatg cagtgtgctc tgtgggaata tggagaaaca aagataagga 8040 

aaacctaggc agaaatagtt caattgattg ttatttcata ctgagacaaa tttgccaagt 8100 

ataaactata tcctattaca aatttagatg ctttgggcca tgggtcagct ccacggcgac 8160 

acctacactg aagttgtatt taagctatct gtctggatca cttggttcac aaaaatgttt 8220 

ggattacaaa aaggaaatat catacacctt ggaagaactg gctcattagt caagatgctg 8280 

cttaccagtt gcctgggttg gctgctgtgg taaagsaata tcggttggaa agccggggct 8340 

ggaagtcggg gagaggctgt cttctatatc ttacatttgc tatgtctttg gaggaaagat 8400 

gctttcggca gtgttcctta aatttaaata tcttttcctt ttggactcaa gggtatgttt 8460 

attcgacttc atcatatcta atctcctaga atattattaa cctgaagtta ttgtcagtat 8520 

cgtatgatat ttattgaata acaatatgct ctgatactgc atatgaaatg aaataacaaa 8580 

tgaacctccc ttactttaac ttaactacta attaattaga aaagaacctt cactcaaggg 8640 

gtttggaaag gtccaataag ggaattaaca gatgagaatt taagagtacc tgaacccatc 8700 

tcgatttgag atttacgtga tgctgggtga ttagtgataa gaacagaatg aagtcagctc 8760 

aggaggcatc gtgatctgga aaggagggag atatttcagt gaataattga gaggattcag 8820 

acaaagtcat atgagagaag cagagacaga cagagggcag gagtcagtgg atatgagaat 8880 

gatactagga agatgaagaa tggactgatt tgtaaagaaa ttgagaccca agttatactg 8940 

tgttaagaga taagttaaaa ccattaaaaa atattgggag atcctgaatt gctttcaagc 9000 

ctagtaattt gatgtaagaa atagtgagaa atgctcttga tgttgtgtct gtttcaggga 9060 

aagttttgtg gagacaatga aataaataaa atagcatatt tagatcaaga ggctggtttt 9120 

ctrtttcaca ggaaacatct aagaaactgt taaaagggac aaggtttgag tgttagaacg 9180 
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tgaaataggg ctatggtaat aaattgttac cattccgaag 

ccaaaaaaag agtttttaaa caaacatatg aagatataca 

tgtttatttc agattcaacc attttaacca caaagaacaa 

atgtaagcaa agatttggat ttgaggcagt 

aaaggctggt gggctttgtg agacaaagtg 

agatgtggaa tcctgtaaca atggcaatta 

aaatgggtca acgtggatca tttttagaag 

tgaaaaaaag aggatgagaa attatgtttc 

catacctcag atagaaatga attattttaa 



ggcttgattt 
aagtttggga 
aaaataatgt 
gaataaagag 
aatgagatta 
gtgatgatgg 
aagataatga 



attatcaaat 
atataataaa 
aacaacctca 
gaatatagga 
accatgccat 
gtgatgaagt 
gaaaagcact 



tatctcccag 
taaactagta 
actgagagca 
ggcattggaa 
gagaattgtg 
ggtgacatgg 
gaaactttgg 



atccaatttt gaacacattg aattcaaagt 



caaggtttat tgcaaaagct 
aggaaaccag tttttagaga 
gacaatccgt catccatgca 



gactaccatg tagacagata accaaataga atagaaacac ccaccatata taataaaacc 



acaggtgcac tcattgagtg aaccagtaca 
tgggaatagg aagactcaat gaaggcaata 
gggaattcaa gaagatcttc tttggcttcc 
tcacagttca gccatggccc agaatggtga 
cagagaccat ttgccctcta ggagcaactg 
tctagggcag ttgaaggaga gagtaaattg 
gcagcaggca agtttgcact 
attttaggcc agaagtcttt 
aaatgagata gttggcattt 
aaatttaaac agcaatattt 
gcaacagcat attaaagctt 
ttcccttgcc ctggtgggag 



gataaactat attgtagggg ctctcacatc 
aggggaagtt aaaaagaata agataaactg 
ataagatcct agactgtttc taggaagcag 
tgaccgaggg cagggccatg gtttggtgct 
atggagagaa taaattgggg agagaataaa 
taaaccaaga atgccatagc taaaagggaa 
aattctagga agggatttct ccacatttcc 
gacatgctag agagaaagat ggattttttt tttaatgaat 
ttggcatttt tggcaaagaa agatttcatt ttcaagacca 
agcacaaaaa atgcttgtca aattcctgct gaatttgaaa 



tttgaggaga 



tttcttcttt cctgcaatag 
tatgtcctcg cttatcctgg 



tcctggtggg atatactccc tcttatccta gtgggacatg 
gcctggaacc atgaatggta ccaagcccta cacacactat 
catgtatgat aaagtttaat ttataaatta ggcatagtta 



ataataaaat agaataatta 
ctttctttca aaatatctta 
ccatgcctat gtgatgtgat 
ggtggcctac tattgaccta 
ggatgattca tgtcaaggga 
gagcagtgca caatttaaaa 



taacaatata ctgtaataaa 
ttgtactgta ctcacccttt 



gtgatgaagt 
ataagcaggt 
agaattgaat 
cttatgaatt 

ttgaaaaggc agtgtgccac aggtaattga 
aggactactg tattcttaga gcaattgtat 
aaaaacaaat tagatcaaca tgggntggta ccattaggaa 
aaattcacaa ctttcaaaga aaaatattgt gagtctctgc 
atggtgtaga tatatattcg tctgttttca 
gcaatttaca aaagaaagag gtttaagaga 
acaatcatga tggaaggcaa gaatgagcaa 
agagagagaa cttgtgcagg gaaactcctg 
tagtcactac cataagaaaa gcacaggaaa 
caggttcctt ccacaacgtg tgggaattgt 
ggggacacag ccaaaccata tcaagatatc 
aaaattactc ggtttattac tgacattctg 
tacacctaat 
attgccaaaa 



ctgatgttca cgatacagta 
tgggaatatg ttccccttta 
ttccaagact cccagtggat 
gttttcatca tatacataca 
gagataaaca ataatagcta 
agttgtctga tgtggtctct 
tttgatgatg tgagaagata 
agaaaggcat catgacgcag 
gcatggagac tctggaaaag 
gggacagcac agattttatc acattactca 
gtttatttat agaattttcc atttaatatt 
agtgaaactg cagatgaggg 
tattattgat agtttaatga 
agatggtaga ttttcaattc 
tatgagatag atattgagaa 
aaagacacac ccgagactgg 
cacgtggctg gggaggcctc 
acacggttgg cagcgggcaa 
ccatcagatc tgatgagaca 
gatccacccc atgattcagt tacctcttac 
gggagttaca gttcaagatg agatttgggt 
agcagaaatg agcaaaaaat gagtgatttt 



aaggtgaatg 
tgtgtctgca 



aaccacagaa 
atcattggaa 



tggtgctaac 
cgcacagttc 
atcacgtctt 
tttgttaaaa 



cccctctctc 
gggactcagc 
gtctcagagg 
ttgcctgtcc 



atgtgcattc cttaagagat aattacattt 
gccaaagttc ttacagttca aactctagga gcaagagctg cccaagaacc 
agttacaggg agagcctcag tgtttgtttg gactctccag ctcctacctt 
tttccttgct cttctctctc catcattctt cccagccttc ccaggttctg 
ttccatgatg ctgcctcaga ctttctatcc atacggtatt attgcactgt 
agaagaggga ggccatctgg agaaggtaca gggcacctca gcagtgtctc 
attgctcctg cttcctgatc cctgccaggg gcacacctgg agaaacttag 
aaaacccttc agaaaagagt aacagagaaa actcaccaaa tacaagccat tcaatgtgcc 
atcataggtt caacccaaat cttgtttcag gttacagata gagaattcca gtgcttctta 
tcagctttac gaattttatt catgcttcag aaagttgcta aacagtgtaa atgaatattt 
gggagaagaa taaaagaact actccgaaaa tctgaaaaga gttaaaataa attccagtgc 
actttgattc ttcctttata aatacaaagg atttatggtc ttttgcatca gccagggtgg 
ccactgggaa gtcagagttt atctatatcc atcttcacat ttattacaac cacacacagc 
atttatcata cttgtattca gacatgatgc tattattatt ttccactggc atggctttta 
aaccccaaag cattagagac agatagatag atagatagac agagatattt atacaataaa 
agaattatat atataaataa aatttcttta tataatgaag aaattcatat aaaggaatat 
attatatact atatatggtt gtataaaatt atatattata cattatatat tatatgtaat 
ttacatgttt ataatttata attatattat atataattat ataatttata aaattatata 
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taaaaattgt gtatattata aaatatgtaa tatataattt ataatatatt ataatttata 12900 
tataatacat aatatatatt ataatttata tataatacat aatatataat atattataat 12960 
ttatatataa tatataatgt attatatata tgatgtattg tatatataca tacatatttc 13020 
caccaatgtg attttaaaag actaccataa actcactgaa aagaaaacat aacagttggt 13080 
ttccaagaca tgaggacatc agttggaata acgtttcttc catagaatta aatgcatctc 13140 
tggtgtcttg gaaaggcaat ttcctctcaa atgggtaaag tgtttgtact aatactgtat 13200 
aaaaatgaat acaataaaat aggcgcaaag aatctcacaa cttaaaacat cccatcttta 13260 
ttctatagct gtaagctcag agagacaatg atcccatgat tttatgggag tagaaaaaca 13320 
ctttattgag tgttttctat gtatcagaca ctgtgtacat tttgctatgt tgtcccgatt 13380 
ttattaaatg cagtcctttt cctgttcagt gtcaccattc ggaggttttt ttcaagacac 13440 
tctggtgctc catgagatct aatctccatt aatgtgacat atatgctgat attatgtaag 13500 
tgtgtgtgca tacgtacact gacacacatc gtttgcattt gtaatttttt tttacccagt 13560 
tgcctcataa aagattaaaa tggaagatta gactccattc agagtttgac aatgcctcaa 13620 
taataaatca gttctcatca tttgaagaac tatgaggtgg aacaaaaact gatttgagat 13680 
tcataccgtt gtatacctac tctttaaaaa aaagatgcaa tttacacatc tgcaaatctt 13740 
aggaaatggc ttagagaatt tgcatgattt gggcacatga agccatttta gcacataaat 13800 
tccttccttt catctctaac gggtgtagaa aggatacaaa catatttttt gcagatgatg 13860 
tgaattattg actggcataa atagcaactc acactaaatg cttcagaaat gtgtttttta 13920 
attgagtgaa gatggaataa aaaaatgcag gaaaagtcct gttctatgag agctgaagat 13980 
atggaattag atacagtatt tgtgagtgtg tgtgtatgtg tgtgccatag ctaatattat 14040 
aatttttggt tggtcttcct tgtccattta atgtttgcag catttgttga tggattttta 14100 
ctaaatgatg acaaaataat gaagcagatt agtaagaaaa cctacaaatt gtttactaga 14160 
tcttcactat ctatattaaa tagatgaact aaaaggtaaa tatggatttt tacctactaa 14220 
agaaaactac tagaatttat tgcactttaa aaatatcatg tagtgtgttg taaaatgaac 14280 
ccaatcagat gttctcttaa aggactatat tagggtattg taaatagttt tcctaaaagt 14340 
ggattttttt tttttgcacc acaattcaaa taatgccaga attgccagct ataaaccaaa 14400 
tgtgaagtcc atgagcttga aaagcccata gtgttcacgg ggatcagtcc ctttttttac 14460 
actggaaaca gccccagaaa gttccagcca gtaatgaaag atggtgcaca atggctgtga 14520 
cagaaccgtg atcaaattcg gtcattgtga gtcttctggc cagcgtcgat tttctaccac 14580 
atgtgttttg cggcctccat aagatgttaa aggaacactg aatcattggc aatccataat 14640 
actcagattc ttggcttaga gagattattt accccggggt gcctttctaa cataaacact 14700 
agccttgggt atagtaacag ggagtacata gaatagtcca actcatatga atgcaagaaa 14760 
ttataatgtg attaattttt ttggtcattt caatttattg aaggtaattc agttggctct 14820 
gtgaatgtat ataggaaatt tagaaatatt aatatgtttt tatttttttc ctctaggtta 14880 
actgaagtga atcaaactat tggattataa ttccatgaag gtaagaaaga gctccagatg 14940 
gttttcatcc tatcaccagg acctgcacag tatgagtacc tggtgcataa caaactttat 15000 
taattatctt ttctctcttt ctttccccaa ccatgtgtat acatacacac gcacacacac 15060 
atatatatgt acatacacat atatacatgt gtatggatgc ataatcacat acacacgtgt 15120 
atatatacac atatatacat gtgtatgtac atactcacgt acacatatgt gtatatattc 15180 
acatatatac atgtgtatgg gcatatacat atatataata catgtgtatg ggcacattcc 15240 
tctatataca cacatacaca cacgtacata caataaataa atggttgaat ggcaaaatga 15300 
ctaaatggat ttagtacaac ctaatgatat gccacagatt tgtcaaagag ccctggaatt 15360 
gagtaaaaat gaatgaacta aattccaaga tcttttacaa cctatgattt cattccagca 15420 
gaaatttgta aatatgtgta ccatagattt atacaaaacg ttaatggtgg cattatttgg 15480 
aatagctcca aatgggaaaa agcccaaatg tcaatcaaca gtagaatgga tggattttgg 15540 
tatatccata gaatggaatt ctagaccaca gttaaaatga gtagattatt gctacaccca 15600 
gcaacatgga tgaaactcaa aactccttga gtggataaag ccaggctgtt tacgattaca 15660 
ttctctgtaa gaacttttat tcaaatttca aatacaagca aaactattct acagtgagac 15720 
aagtcagtgg ttacctttgg cagttagtgt acccaggctg gggcaggagg gaagtttcta 15780 
ggtgatgaag atgttttaat tcctgatcta atcctagctt tactgatggg tttactttga 15840 
aaaattcatc aaggtgacta tgcactttct gtaggtttac atgaaaattt caatgacaca 15900 
cttgattgta tgactttgga tattactttt cttttctgtg ccttaatttc ctcaattgaa 15960 
aagtgaggaa gagaatgctt acctctaagg attgacatca agattagtgg ctgagcacgg 16020 
tggttcacac ctgtaatccc agcattttgg gaggcagagg caggcagatc acaagatcag 16080 
gagtttgaaa ccagcctggg caagatggtg aaaccccgtc tctactaaaa aaatacaaaa 16140 
aaaaaaaaaa ttagccaggc atggtggcat gcgcctgtat tcccagctac ttgggaggct 16200 
gagtcatgag aattgcttga acccgggagg cagaggttgc agtgagctga gatcacgcca 16260 
ctacactcca gcctgagcaa cagagcaaga ctccatctaa aaaaaacaaa acaaaacaaa 16320 
aacaaacaaa caaacgaaaa gattagatta gcaagtagag tatgtgtaaa accacctgga 16380 
aaaatgcctg gagtgtgtgt tcagtattta attgtagaaa cagttacctt cataaacaac 16440 
aggacattgt gggaacatct gttaattacc gaacagtctt tttaggaccc cctggggttt 16500 
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gtggatagta atttttatta ataacacctt tatatgggct gaaatcacaa tgatactgtg 16560 
tcaaattaac cagcagttaa gagctatgct tcttcttttt gccaatttac ttgttttatt 16620 
tgtttttaat tgacaaataa aattgtgtgt atttatcttg cacaacaagg tgttttaaaa 16680 
tatgtataca ttgtggaatg gctaaattga gttaattaac atatgcatta cctcacacag 16740 
ttattttagg aactacgttt ccagaagtct gtaaggtaga tggattgata tttttaacct 16800 
gattcattca ttcttcattt aacccaggaa ttttttgagg gtcagtttca gagcatgcgc 16860 
tacactaagt gtcgggatta ctaacatcaa ggaacagtta ctgacacaac aaatcgatgt 16920 
gaagtcctaa aagagagaac atagacagac acataaaaac atagtataac acaatatatt 16980 
aaaatataca caacaaaaca tataagataa acacagaaat gtttggagca atgaattaga 17040 
tacttaactg gtttagaaat gttttgaaga aatgtatgct aatatgtctt ttgttcaaaa 17100 
tttattaatt atataattaa cttttaatta tgataaaata cataatataa aatttattat 17160 
cttaagtact tttaagtatt cagttcagtc atgttcacta tatttatatt gatgtggaac 17220 
agatctctaa aacattttta tcttgcaaaa ctgagactct acaccctttg aacaaccttc 17280 
cattaatatt gtttacatgt taacacattt cattatatta taatgtatta tcgataacat 17340 
attatatata aatatattta taaatataca tgaaatttgt atatttgaag tattagtttc 17400 
aaacaatgct acatatatct aaggcttcta gtcctatatt ggacttacag tggaaatctg 17460 
ggaaattggc cagcatttca gtgtgaagga ggaaggatgc cgtgttctct cccatctgtt 17520 
tcagaaaacc agtgtggaga agatgcataa taggcttgac gcatttttta taatgatgtt 17580 
tatattttca ttttcttttt catagtcaca atgaaagtga aagcaaagga aagagctaat 17640 
gtcccacagg ccactgggta atagcgaact gtgaagggtg tggagtggac tcactgggag 17700 
acgggagccc atggacccct ccccagctcc tttgctccca cagcaatctt ctcccttctc 17760 
tttctttgtc tcacagtcta gaagagtgac atttcaaaaa ttatagcagc agaggtgttt 17820 
ttacctatag caaaagatgt acatgtaatc acattatgac atactacaaa acaggtaaaa 17880 
acagtagttt ttagtacttt tataaattca aacttagcaa ttttccttat ttcaacatta 17940 
atatcaatag gcattaatgc cagtttttta atgccagtga tttttaacct gctgaaattt 18000 
gttttataac ttactgcggg gtaaatgaga tatttagtta tacttttgaa agtattgctt 18060 
gtgtataaaa caccttgaaa taaaagtttg aagatgtcct atgagttgct gtgaaaagag 18120 
tagttgtgaa agtttcaaaa tgacagatgc attttaatgg cagagtagga gttcatttat 18180 
ccaaaaaaga gggaaataca gctcaaagag taaagcagat gcagagaaca gcaaacaacg 18240 
gtcagagcat tgtgaatctt cggctttgat cttgagtacc tagctagata ttactgtcaa 18300 
cggctaagat taaaatgtaa caatatcagg ggaaatgcat ttgagaacag taaaacattt 18360 
attttgtaca tatatagctt gaaatgcata tgggaaatgt aggtggagat ttttagtagg 18420 
caatagcata tgtgagtttg aaatgtaatg cggatatata atctaaaact ggttctcaat 18480 
tctgactgca cattaggaaa tttactgctt aaaatgacta aatctcaggc ccaactgcag 18540 
attgattaaa ttaaaattta tggggatgtg cgcctgaatt ttttttttca ttaaaaaaaa 18600 
aaaaaaaaga ctccccaggc gattataaga aactgccagc tttggaaact gctggcctac 18660 
aattagtgtc aacgtcatca ttgtgtgggc tgttgttgga tggatgagaa caaggatgga 18720 
tgagaacatc tagagagtgc aggttgttcc tcttcttgtc ttctatttgc ccttcagatc 18780 
tgccctttca tgtagtctgg gcaccagagc accctgagcc cagtggactg caccagtgga 18840 
ctcttccatc ctcagggttt tggtaagcac catctagagg aggcatagag agaaaatcag 18900 
ggacccagga aggtgaaaac aggttttgtt ttcccagctt cctttctgct gggtcaaggg 18960 
gtaactgcag gttgagggtc atcctcagtc aaaggccaca gctgcagcag gtgcacctct 19020 
tcacacagct gcttctagag tctgcttagc tgcttcaggt caagggatga tgaggtctcc 19080 
caactgttgc tagctctata gttctcagcc atcttcagtt gattccccta aatcctgcct 19140 
gccccttcgt agacagtacc ttcattaaat attcctcatt ggttaaaaag tagtgacctt 19200 
gtgctgaata ttctatgcca agcctctgcc tgctgcagta acataagaag agttgtggct 19260 
gaagatggaa ccaagagaga aaatgacctt taaaagatgt aacgaggtgc ccagggaaag 19320 
ttttggcaat ggaatctgag catgtagagt ttaaatgtaa gaacggaaaa atatctgttg 19380 
tatttgaaag gtcacgaaag gcttcttccc gagaggaatt ttagtggtgc gctaacaaaa 19440 
atcagatggc aggggctggg gcatcaatgc aagtgaagaa atgaagggtg gttatttgac 19500 
aagtttgatg ataggaggaa taaagaattt ggatttatta aaaataagac atggagctaa 19560 
tcaattgaga ttgccaaact cttttaatga aaagtagctg tgctctatct ccagcagttg 19620 
caacagagca ctaataatat caccgagact attgatgtga gggcaatata catactcagt 19680 
gctgtaaagg cttagatttt tgatagctac tcaaggaaga acacaaagaa ttaagtcttg 19740 
atttttaaaa tgaacaaaga tataaagaga gaaaatacag aatgtatacg aataatcaaa 19800 
acatatgaaa cgacagccat ctctggtaat caaaagaagg ctactacaag caatgatata 19860 
tgttttgcct tataacttca taaagattca gaaaatgggc catagccact attggcaaga 19920 
aggcaaataa agtccatttt tggtcagaat gtcaatttat gtatcttttc tgaagtctgg 19980 
tctttgcact cttacccttt taaagaaaac attaattgtt tcactgaagc tctgagttca 20040 
tatggccatg ctctcctgcc caggtgttct ctcccttaag ccttctcttg agcacttatg 20100 
tgccactcat catcatgact tcagtcctta aagtttcttt aataccactg gactataaaa 20160 
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taggactgac 
atataatttt 
ttacatagaa 
cattacatat 
tgtattgaca 
tcaaatgaat 
atgctacacc 
caggaatttt 
ggtcgttgtc 
gcaagtgtca 
aaaagatgaa 
tatttttaaa 
agtaacttac 
tgtgtctttg 
ttcaaactgg 
caaatagtta 
tgagttttgt 
attaggtggc 
catcctattt 
cctatttctc 
aaaggagggg 
ttaggaggct 
gccagtgctt 
tcacagagaa 
catatttagg 
ttttaattta 
gctttaagag 
ggagcaaagt 
tcgagttcaa 
tccctgggtg 
gaaaataatt 
ctttagaaac 
tcattacaaa 
agtgatgtcc 
cagctagaat 
tgactaagct 
tgcatgagat 
cgcgttagcc 
ggtttagaca 
atggatgaac 
ttctctgcca 
caacactttt 
tgatctctaa 
taaacttaaa 
gaatttatat 
tttaggaatc 
ggtctctcta 
aaatcagatt 
ggaagttttt 
gtattctcat 
gaaaattcct 
aacattctta 
ttgtattgtt 
ccagagcttt 
atagaagcct 
ggtgtgcaca 
acacattaca 
attacagata 
tagtgctatc 
tgtgaaaaca 
ttcgtgttat 



gaaaagaccc 
ataacataca 
aacatagtta 
ccctaaaaca 
ctaaagagta 
gcatgtcaaa 
caattcaagg 
gaaatgaatg 
caggaatcag 
tttaaactgc 
aacagttaac 
tttgcttatt 
ctgatagtga 
tccatctcaa 
attaatggta 
tgaacaatta 
tcacagagtc 
ctatgtccac 
ggcaagtgct 
tctcttcttt 
aagaatttgg 
gaattaccaa 
cacaaatgct 
aacaagtaca 
ctttacagtt 
gscttttcct 
gtagaaaaat 
cagtccacaa 
ctccaaacat 
aaaagcaagc 
ttcaaatcta 
atcttcttgt 
ctccaactta 
acatatgtag 
ggggtgaatt 
ttcaagtgtg 
gttggtcttc 
cttttctgtt 
attgaccagc 
cctgagagga 
ttatggactt 
attaattgct 
tttcttgaga 
tactgtatat 
aaggccatcc 
ccacagaccc 
gtttttattt 
tacgatctaa 
agtttcttat 
tataaattaa 
gaaataagta 
ttaagagttt 
aaatgaagcc 
gctcctccaa 
aaaacatttt 
aataaatact 
tattaatgct 
caatctcaaa 
aaaaccatac 
ggattatatt 
tcatgttaaa 



attcaacatt 
tctatcatga 
cttacaaagt 
tatttaagaa 
actattttaa 
tattttattt 
tagaattcaa 
tacaaaatga 
tagttatttc 
aatcagttat 
ctgaaagttt 
tttgtcttgt 
taaatcaatt 
acaaccccac 
tccatgttac 
atggaacaca 
tatactcttt 
aatccatggc 
tcagtgatat 
ccttcccttt 
caatgtgtca 
gactggctgt 
gtctcaggtg 
ggaaaaacag 
ttcgtgctcc 
aaaaaaaata 
gaggcaagtg 
agtttgtctg 
tagcagtgca 
catttaatag 
ctatcctaag 
aattatttct 
gcaaaacgct 
ctctctgatg 
ttatatctcc 
actcaatgat 
agtgttgcca 
agataatttt 
aaactatttt 
gacaggaaaa 
ataacataat 
tcttttttta 
aagtctgtct 
tcatatttaa 
tctgaatcca 
atcagtaatc 
attcttgtag 
gttgagttaa 
atatggcaat 
gtacatttct 
tcaaaatgca 
tctttattga 
aatgaaattg 
attctgtcgt 
aaaactacat 
ctgtcacaca 
tcttctagga 
gcttgtcata 
catcaggtta 
cctactattg 
tgttaatctg 



ccattgaagg 
aatccattga 
catggttgaa 
ttatgaagag 
aaaggtaata 
aactctttaa 
aattcaaaat 
tgcaaaactg 
cagggacaaa 
ctacaattta 
ctcctgcact 
tcttttttaa 
tgtgccacat 
atgcctacct 
atgactttat 
cacacacacc 
tctattgtgt 
ataaacgccg 
ctgtagcctg 
tcttttgaat 
gttaatttca 
ttccattgat 
tgactgaagt 
cactattcca 
ttcatattct 
gtttgcttta 
aaaatgaatt 
tgacatcatt 
gacgggaaaa 
gaatgaaaga 
gagaacactg 
taaaatatcc 
ttttggaaaa 
atttgacttc 
caaacagttt 
agtaaagggg 
gttaactgag 
attattcgca 
tttttgaaaa 
cagcacctaa 
aaacatttag 
caaatttgat 
tactttaata 
atctgagatt 
tattcaatta 
aggcaagtac 
tattagcaga 
atctcagaga 
attttgtaag 
cccttaagct 
agattgtaaa 
taagttatac 
atcaagggtt 
gttagaattt 
aatccgtttt 
aacatgcaca 
cagagattaa 
tttctacaag 
aagaagtggg 
cctggtgaca 
ttaatttaaa 



attaaatgct 
ctcttcacaa 
atgaatctct 
gagcattatt 
tcatgatact 
ctaatgaaga 
aaacacatat 
gctttttcca 
attcatttgc 
tggaatcgta 
aaatgagaaa 
tttcacaata 
ctcatggcat 
caacatgagt 
acgtgttctt 
cagataggaa 
catataatgg 
gtaaatttac 
aagagaaaag 
ttcctaaaac 
tcaatgtaga 
gaaattaaat 
tgtgattgat 
aaatgcagaa 
tactttgatg 
cagtttacaa 
aaaaaaaaaa 
gtccttccaa 
tgcagtgacc 
gtcctgctat 
tgcaagataa 
ttccagacaa 
aggaaagtgg 
atccaagaat 
ttcataaata 
agaacctaat 
aaactatgtt 
attctctttg 
tgaccacaaa 
aaatagtaga 
ttggttcact 



tttattattt 
aaatgacata 
attgggacat 
aactaaaaaa 
ttaacagcca 
tgatttctat 
atcaaactcc 
atattttcat 
ttttaaaaaa 
atacgttttg 
ttctatgttt 
aggttttaga 
tattttttga 
taaaattaca 
cagagcataa 
taagtagtag 
gtaaatatat 
taaatattat 
tagtactcaa 
ttaaatcaaa 



atgttttaag 
ttagatgctc 
gctgtaacgc 
gaaaacaaaa 
catatgaata 
aatgggtatg 
atacataaat 
tggagattgg 
atttctatgg 
accactttaa 
gaagctcagt 
tttgactgac 
atcagctttc 
tttatctccc 
cactttacag 
tgggaccctc 
tgacaattca 
agattctccg 
ctaagttttt 
aaaggaagaa 
caaagttaaa 
caggtaaatt 
tggaaagtac 
caaaaccaca 
tatttctgcc 
agtaagatat 
aaagaatcca 
acaaggagct 
aaaatcagag 
aaatactaca 
tgaatatcat 
agcaatgaca 
aaggacattg 
cgtgtggaaa 
tcattttatt 
tcctgacaat 
aataatcagc 
atgggtatta 
catgttaaaa 
aggatattat 
tttttagtat 
tgatacctac 
cagtgcattt 
aatgatataa 
ggatgtccga 
tcctatcatt 
aatgttgatc 
attgtggaat 
aactaacaaa 
aatgatttta 
ataaatcaga 
ataaaatatg 
atatttctgt 
ctacaggagg 
catgtaaaat 
acacatgtac 
agaaaaaaga 
atctcagtga 
cagaatttaa 
cacatttttt 
ttaaataaaa 



20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
2364 0 
23700 
23760 
23820 
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tctgttaaag 
ctgatgctat 
ccctcaattt 
tgtataactt 
aaaaaacaac 
ttaatgagta 
ttaaaaatat 
ataatatgac 
ataaataaat 
gttaaagata 
ttatttcaat 
gcacttctag 
agctgtggcg 
aggcgggcgg 
gtctctacta 
tactccggag 
ccagatggcg 
ccaaaaaaac 
ttgggcactc 
tccgattttc 
attgtacctt 
attcattgca 
tattatgaaa 
ttatgaacta 
atcgaaatat 
accctcccaa 
ctggtggagg 
ccttcacatt 
aaaagtactt 
tcccaaagga 
ggccgctgcc 
aagacaaacc 
gtaagtgtca 
gcaaatacct 
tcccgaggta 
tcttggtttt 
taggaatgat 
acactttatc 
ttaggctgtc 
tttgaatcaa 
tgtagtactt 
tggtccttcc 
tcagacaact 
tctgctcaga 
ctcagtgtca 
ggcagatctc 
ggtcttatac 
ttccagatca 
ggacagctgt 
cttgtgagga 
gatgtcattg 
gaaaaccaga 
caatactctg 
agagaaaacc 
gccacaaaaa 
cccaccattc 
gaacactaca 
caaatggaag 
ctcaccctct 
ctccttgaat 
cctggaggtt 



aaaattgttc 
attttaacct 
tagtaaaaaa 
acttattagt 
agaaatgagt 
caaatactca 
tttaatgtat 
aatgatatat 
aaataaaaat 
aggcttggga 
taaactagtt 
cttctctttt 
aggccgggcg 
ctcacgaggt 
aaaatacgaa 
gctgaggcag 
ccactgccct 
aaaaaaaaaa 
tttacttcct 
tttgatttgt 
tgtctgtcaa 
aaaacttgat 
taaatttagt 
aaaagaaaat 
ttaaaaacat 
tttgctgata 
acaacacccc 
ttaatcatta 
tacaatgtat 
ttgttatatt 
tctgtctctc 
tgtgtaaact 
taatctgttc 
ttgtctttca 
tatttctgga 
caatgcttaa 
tttaatatca 
aaccaaagtc 
catttctcta 
ttgatccatg 
ttcattttta 
cccaggacag 
ttgtccatgt 
ccagtgcacg 
taaaaaggag 
tctgattctg 
ttttagagcc 
acggccaaat 
ttgctgtaat 
tctactaaag 
gtttatacgt 
cttctctaaa 
acaccatctg 
ccacgggatg 
gagtgccagc 
ccacccctcc 
cttactgatt 
tggtgcacag 
ctgggggcac 
ctcttttngc 

agggggtata 



aaatagcttc 
acagcttaca 
gaaccctact 
cttaactttg 
actcatgata 
ttcagctctc 
ttatttgtga 
gagacacaat 
aacataacct 
actttgccag 
gctttcagtt 
caaaatgagt 
cggtggctca 
caggagatcg 
aaattagccg 
gagaatggcg 
ccagcctggg 
acaaaagctg 
tgactctact 
gctattgttt 
aatttagaaa 
atcctaagct 
aagatctatc 
tctcattgaa 
gttggcaatg 
tgggccctgc 
tgtttctccg 
gcttatcttc 
tgcatttact 
tttctatttt 
ctgccttccc 
gaagcaccac 
ccccttgtac 
ccaagaaatg 
tttgatttac 
agtcccaaat 
attcccgaga 
accttcacaa 
atgcacattt 
ttttgcccac 
aaaacaatta 
tgtgtccagg 
ccctttgcag 
ctgaagctaa 
attccattat 
tattttaaat 
aaaagagaac 
agttacgtgg 
atgggaatag 
aatacccata 
gtatcctgat 
agctcttgta 
tgtggacagc 
agggcccaat 
cttcctggtc 
aaatgtgaca 
ttttnnggtt 
gacgggctgg 
caccctccca 
tcacgagttt 
gctgaagatt 



tccctaaagt 
aacaaatttg 
tttataactt 
ggagagaaga 
aatagtatat 
aaagtaactc 
gttacacaaa 
ttattttaag 
aagtatacct 
cacagtgtgg 
taaaaagtag 
tcaggccttc 
cgtctgtaat 
agatcatcct 
ggcgcggtgg 
tgaacccggg 
gaacagagag 
cagcgaggcc 
cctgcactct 
ttcttatagt 
aatgaaaaat 
agtgtgtcaa 
accaatatat 
ttgtttttta 
atgagtctat 
ctggttttgg 
gacctgagag 
aaactttttt 
atgtttgaat 
agtaagttta 
cactgaatct 
aaaactactt 
tgtagcataa 
gaaattgttt 
tgaattcaaa 
aacttttctt 
ggtgtagaag 
gcatcctcag 
gtggttgtca 
cagaacaaca 
ctagaatccc 
tggactgccc 
aggccctggg 
tgagtcattt 
ttctctttca 
tgatttttga 
gttcacatca 
gcttgaacaa 
gaatgtcatc 
aaagtctttt 
tccactcctt 
ttattatttt 
agttcaattg 
gccacaagat 
tcctgccagg 
actttttaga 
tnnattatga 
gtatggggta 
gcacatccat 
ttataaccca 
tccaacgcct 



tttctttagt 
ggtcttttaa 
acattaaaca 
ggtttaagta 
gcacctaaaa 
tcccaactga 
tccattagtt 
aatatttata 
tagaaaaact 
gtatcatagt 
atgattttac 
ccatcctgac 
cccagcactt 
ggctaacatg 
cgggcacctg 
aggcggagct 
agactccgtc 
gtttctgaag 
gagactgacc 
taagggaaat 
agaccaaagg 
aagtgtacaa 
aagaaatagg 
atcaaattaa 
aaagcaattt 
gggccttgaa 
tttttctttt 
aagaatgatc 
aaaacattat 
ttgaagacaa 
ccagcccaca 
ttatgaaagc 
actttatcga 
ggtgtaattg 
gttgaatgat 
aaaatcagag 
tagctactgt 
gatgaagact 
gaaaattttt 
gaaaccaaat 
ctgagtcttc 
ttccaaccct 
cagagattcc 
tatatggact 
aatacagccc 
atttaaattg 
actggagatt 
ctgtcccagt 
tccccctgcc 
aaaaaggtta 
ccatatcatt 
ttttttttcc 
gattctaact 
tgcccccata 
cccactacaa 
acaactcata 
aggacagttc 
ggggaatgac 
gtgtttngcc 
atctctaata 
aatcaccatt 



tcaaaagatt 
aacttccatt 
ttggaaataa 
cgtattttaa 
tatactaaaa 
ataaacacgt 
taatatttag 
taataaataa 
atggggaaat 
gtaggtattc 
atacaagttt 
accttgaaaa 
tgggaggctg 
gtgaaacccc 
tggtcccagc 
tgcagtgagc 
tcaaaaaaaa 
atggcgggta 
gtataacatt 
gtgaaatatt 
tgggagtata 
tttcgagcat 
atacaaagtt 
atgaagctac 
agtgggtata 
tctgcaatcc 
aatctaattt 
ttaacaaatt 
tttgccacta 
acaggccttg 
ttcacaaaca 
agaatggaag 
actcactgca 
tgccttacct 
tcaccaaact 
ctgccaaagt 
gctatttctc 
tcaataactc 
tttatattga 
gtattctttt 
ttttgttctg 
ctcctctctt 
cgccgggagg 
tggtgatgat 
tagctgctga 
agagatttca 
agggttaatg 
tggcctcctg 
ttacaggact 
gagaacagaa 
gtctcttcgt 
aaacaccaac 
aaccagagtt 
tcacatgcca 
acacagggtt 
gaactcagga 
aggaccagtn 
tgggagcgtc 
agcttggaag 
ccctcctttc 
tggtctttgt 



23880 
23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
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tgtgaccagc cctatcctga agctatacag agggccgcac cctaggttac ctcagtggca 27540 
taaacacagg tgtagtagaa aggggcttat tatgaataac agaaggcact cccatttctg 27600 
ctagtcccga gggttttggt agttctgtgc caagaacagg ggaaaaagat gaagtacaat 27660 
ttttattaca ccactgctgt ccatactggt taccagcatt ttctttggtc tctcctcaaa 27720 
tgacaccact ctccgctgcc ctgtcacttc cctgaacagt tctcttggtg attttctctc 27780 
ctcgtcctac tttccctttc atgagcacaa gaagtagtca agcggtgaac cctttatcgt 27840 
tcttgaagca cctctcactt tggagacttt aataccacaa cctgcttgtt ttccttccat 27900 
gcttctgact cttcctgctt tgtctccttg gagatatatt agtgagagaa gtaaaatgtt 27960 
gcaactggtt taaatgataa tttgatatac agatatatgc attctccaag gcacagaatg 28020 
tatttatatt gctaagaaat gtgctaaaaa gtggagacaa ggcctgaaga atccctgagc 28080 
agacaacact agtcaggcct cataagtgac ctcaaccttg tttgatttgc aaacataagt 28140 
gaatgtcagc ttgagctatt tcttgtaaat gcctgtattc aaaacagaac ttaagcttaa 28200 
ccagtcagaa gcagtctact cacttataat cctataactg gggggacttt ccgaaggnat 28260 
agatcaaata aggcaactgt ataactgaaa ccaaccaaat gctttctctg atttacttct 28320 
gtgttgttct ataaaagcct cctctttctg tttcctcagt aaagctcctg aaccagttct 28380 
actttggagg tgcccaattc atgaattgct tgctcaaata aactattctt ttaaaattta 28440 
ttgggcctcc atttactttc agataacagt aattatttgc atgcctatgg aaggaaacat 28500 
tggtattact gtcttttttt tttttttttt tacaatttaa gttctaactc tacagaattg 28560 
taaaataaca taagaattaa cccaaaatgt gcattcctat caagtcagat aatcacccag 28620 
aaatcctgct aaaataattt ccagaatttc actgaaagga cacctagtca tttcttttgc 28680 
ccatttcctg acagtatctc cactagtaat aaaaataata ttgaaagaaa attcttaatc 28740 
tcaaacaggc caaggatgcc cataaggttt gacctgatat ttacatagtt gcagaagaat 28800 
ttaaactaac catatgggca tctttgagtt tattctgtag ttcttaatca atgaattatt 28860 
aagcaactac taagaccaga gaataaactt atgactagga gttttcaaaa gaaatctttg 28920 
attggtcttc taaaaatccc tatgattttc ttttctgtat tagttcattt tcacactgct 28980 
ataaagaact acctgagact gagtaattta tgaagaatag aggtttaatt gactcacagt 29040 
tctgcaggct taacaggaag catgacttag gggtctcagg aaacttacaa tcatgccaga 29100 
aggctaaggg gaagcaagca catcttacca tggcagagca ggagacagag cgagtgaaca 29160 
aagggggaag tgccacacat tttcaaacca ccagatctct tgagaactca ctcactatca 29220 
caagaataac aagggagaag tctgcctcca tgattcaatc acctcccacc aggccccttt 29280 
cccgacacat ggggattaca gttcgagatg agatttgggt ggggacacag agccaaacca 29340 
taacatattc taagacgagg aaactgacca caagaaactt tctccagcag atttcacctg 29400 
catttcctac attcaactga atcctttaaa ttgactgagg cacatcctgt accaaggaga 29460 
gacctttgtt actgcctttg atcctgagac cttctgagcc aggtacttgg gtgcttttcc 29520 
tggggaggct ttgaggcgtt tactccacac ataatgtcaa acgtgtttta taatagtggg 29580 
atagtcagac ttgattaaat gagaattatt ttcaaatatg atactaagta aggtcttgag 29640 
tgtagaatca aattgtccaa ttatatccta ataaaaagga agacagattc ttagtggacc 29700 
ttttcaacta tctatacagc agtttcataa aacattagga aatttagtaa accttattac 29760 
ttctggggag tcagtgagaa tcagatacat ttcttaaatg tttcatttca tcaagagcag 29820 
agtctatgaa attaagggta agatttgctt caaatcttaa gaagaaagca ctttgatata 29880 
tatcttttaa aactaaaaac aacattgatg gtattctaca atacttgcat tttcctctca 29940 
cttatgtagt tcattcctgt tctcaggaac cctgctttta agaggaaggt cttggccagg 30000 
ctcagtggct cacacctgta atcccagcac tttgggaggc cgaggcgggc agatcacctg 30060 
aggtcaggag ttcaagacca gcctgaccaa tatggtgaaa ccccatctct actaaaaata 30120 
caaaaattag ccaggcatgg tggcaggtgc ctgtagtccc agctactcgg gaggctgaga 30180 
caggagaatt gcttgaacct gggaggcgga ggttgcagtg agccaagatc acaccactgc 30240 
actccagcct gggcaacaga gtgagactcc atctcaaatt aagtaagtaa ataaataaat 30300 
aaataaataa ataaataaat acagtgtcct ggaaagcctg actcggtaga tggttaccct 30360 
ctgagagttg gttaagtaat atcaccttag caggtgatat tacataagag accattctgg 30420 
aagtataaaa agttaaagtt gctggtacaa acatttccac aaggctctaa gagccctctt 30480 
ttactgaaga caaactgtgg tgtgaatctt acatcagtat ccaacatcag agtaacgaga 30540 
aactacctgt agatgacaaa gactgaatgg ctgtggccaa tttattataa taaccagtga 30600 
tgtcagaatg gagaagaggg gaattctcta tggggtgtag taggcaacaa ttccataaga 30660 
gtttggcaat gtttctcctg tgctgggaat tatcatgtac agccagagta ctggcaaatc 30720 
ttctggacct tcccatgaat tatataattt ctaaaatatt tatattaata atttatatta 30780 
ataaatgaaa ttgtgtcctt gaagttcata catcaaattc ctaaccacga ataggactct 30840 
atttggagat aagggctttg tggaggtaat gaagtttgaa tgagatcgta agggcagggt 30900 
actgtgatca taagaacccc aatatgactg gtgtccttat aagaagaggg agagacacaa 30960 
ggagcatgct tacaagaaaa ggcaatgtga ggacatagtg agaaggtggc catctgcaaa 31020 
tcaaggagag agaccttaca agaaacctac cttgccagca ccgtggtctt ggatttccag 31080 
cccctagaac tgtgagaaaa taggtttatg ttgttcaagc cacccagaca gtgttatttt 31140 
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attatggcag ccaaagagac taatacaata atatttttct gtttaaattt tatctagcaa 31200 
tagctgagga tctttcccaa tttaataatg cttgctatgc agctctcacc atagtgtcga 31260 
ggtaatttaa gaaatatagt gcacatcaca aaacataatt atttgtagca tcactctttt 31320 
cttaaggtga aagaataaat ttttatgaaa tctcaaaaga gatgtagaca tttgattcat 31380 
ttttcttaat ttaggataca atcttagaaa gccaaaatct aaataattgt tggtggagat 31440 
ttgggcactt atttaggata agatcatgga tacctaaaaa acagtggttg gttaattcat 31500 
caaaatgaca tcaaaatatt ttaaagtaaa cttagaagtt tccacaatta taaatattct 31560 
tagctctttc caaagctaga aaatgtttta tttttagtaa tcaaagatat atgaaagtaa 31620 
acacagaaaa attattcaag tgggatacag aatctctatc acctattatc agaattttga 31680 
ttacacaaaa gattaaaaaa tgacttttca cattgtaggc aagtcacgca tcattaaacc 31740 

aagggaacaa gcacatcaaa aacagagtga aatttgccaa aatataaaat attttagttt 31800 
tgttttagat taatatgtta ttaacaaaga caaaattcaa tttctaagtt ttgtttttca 31860 
ctatactctt tcatattctg gaaagatctg tcacttttct aaaatacaaa actaattctt 31920 

ttcctcgaaa gacaaaaata tacacacttg tgtttccctg actctattat ccttagtaga 31980 

gtctcaatca cccatttgat taagcttatt tcatagaaaa aactgggtta atgggggaat 32040 

aattaagaac ggatgtcata cagtatttac tgtattacca aaggtcacag gaaacacacc 32100 

tggtggccgt gtcatccatc ctttacatct gcttgccatg gcaaaaataa atatactcat 32160 

tgatatagat gtatcccaag agatagtctt ctggacagca tataaaaata acaagcacaa 32220 

gtatgtaaac ttaaatttat atttagtagt caaggactca actgttatac attcattaga 32280 

agttactttg ttttctaaga attctccaat tttaagggta tttaacatta gtcaaggtct 32340 

taagtaacct aaggaagtta gaaactaagt taaagctgtc acactgcaac acatagttac 32400 

tattgacaga aaaggtttat cataataata atttaattta atgcaatttt tcatatttta 32460 

cacattgtgc agaaataatg ttagttttta tgatcgatag accactataa attcagaaat 32520 

tgcagtttga ctcttatcat catgagaaaa gcaaccaaaa tagagtgaat tgtaggtcaa 32580 

attatattta gtgtggcttt atcagaaaaa acatatagac tttttaataa atgaaattat 32640 

ttatataatt tgcctaataa tggaattaga taaacttgga atttaaaaat gcttctgaac 32700 

taaaggtttt ataagaacat ttttaatgtt ttccaacttt attggtttat aattaacata 32760 

aaaaatgtat atgttgggtt gatgcaaaac agcaattact tttacacaaa ccaaatattt 32820 

aatgtacaac ttgatgttgt gacatacata tacactgtga cgtgacaatc acaaacaact 32880 

tagtgtatgt atctattaca taccatggtt atctttgtgt gtttgtggtg aaaactctta 32940 

atatctatcc acttagcaaa ttttaagtat acaatacact attaaagatg atatttttaa 33000 

tctagtgtat ttaaagtatt agtttatttt tctttttttt gaatttggaa aaattggatt 33060 

tatatgtaca tttatcttta aaagccaatt agaatagtac ctctttaatt aaagagatag 33120 

gaacatttat atccactaac tgaggtcaag acctttcaca attacagagt cacaaacgtg 33180 

ttagagaact cacagcctga tatcttccac ttcagccacg agtcaagaat acacacaaag 33240 

atgcaaaaat ccaatggctg agatctcaaa gagaagttca gtttcccaat gtgcataaaa 33300 

ttctcaattg atttgagctc acaagtaaaa caaaatagac aataagaata ataagccgat 33360 

tacgtcatct tttattcaac agacagtaga cctacagaga tcatcgaatt ggcagatcat 33420 

aaagccaaat tcctaattgc ttcctccagt ggaaaataag ccatcagcta tgtcttactc 33480 

agcaacaaaa caaaatatat agttagtagg aaacaaagtt aaagaaacag agtgaacaag 33540 

gttaaatatc tattgccatt tgataccctc tctgggaggc aggaaaattg ttcatgggct 33600 

taagcaattc aggaggtcca tttcttcagc cccacagttt acctgactct aagttgatga 33660 

actagacatt ttggcagata atctacctct aaacccatta acacataaca ttcccacagg 33720 

taaaatcaca attctcatgg aaatatcaaa taaacatatg tcaaatgtta tttaactcat 33780 

cagtttaacg aaagaaccag taagatgttg aaatcagttc tcagaaattt gttggttttt 33840 

ttttcaacca agaactcccc actgacacca accctatcaa aaattgccat ttgacatttc 33900 

atctggatgt tttaaaggcc tctcaattca ctgtgtccaa aaaggaattc atgactttct 33960 

tttccaaatc tctttccttc tcttggggtg ttcactactt ctggcaatga tcccactttg 34020 

tgttgaatta tttacaccag aaatctaaga ctaataaatg atactttatc tcactacact 34080 

ttcacctgcc attacatgtt tattaccaaa ctctgttgat tttactttct aaatatctct 34140 

tacctgtatc tgcttctaac ttaaccacgt tcatccatac catgacacca ccatcaccat 34200 

ttcatctgga ccatatcact tagactaaac cattagccat atgtttttct cattggtttc 34260 

actacctcaa gcatcttctt atccctccag ttttgtccat atatcagcca gagtgataga 34320 

tatgataatg tattctacta ttagaaatcc gtaatgattt cctattgctc ttgggataaa 34380 

aaacaaaaca ctatgaagcc ccattgatct taggataggg acttaaactt tcttattctc 34440 

caccattttt gtgatagaga ctcaaacact gtgatcctcc ctcctcttat aataaagacc 34500 

caaaccttat aacacttcat ctctcttagg ataaagacca aaatctatga taccctgctg 34560 

ctctcaggat agagacccaa gccctctatc tggagtacaa ggctctctta caatgctgct 34 620 

agttacccta acttcctctc ttcaatattg tcatgaacca ggattcccct cactttctgg 34 680 

ctcagccttc cttaatttct tcataggttt ctcaacttct catcactgca tcttagaagt 34740 

gtataattta tttatttatt tatttttact ttcaatattt ttctcttaaa tctcctactt 34800 
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tgctgcctgg ttcaaaccta tgattatcca agtttatctc atttttattt tgtcagggcc 34860 
ttttccttga ctgttcaggc aagaccaagt tatacccctc tcctgatagc acttaaaaaa 34920 
cattgtaatc ttggccaggt gtggtggctc acgcctgtaa tcccagcact ttgggagacc 34980 
gaggcaggtg tatcacgagg tcaggagttc cagaccagcc tggccaatgt ggtgaaaccc 35040 
cgcctttact aaaactatga aaattagcca ggcacggtgg ctcatgcctg taatcctagc 35100 
tactcgggtg gctgagacag aagaattgct tgaatctggg aggcagaggt tgcagtgagt 35160 
tgagctgaga tcacatcatt gcactccagc ctgggcgaca gagcgtgact ccgtctcaaa 35220 
aaaaaaaaaa aaaaaattgt aatcttacat tgaattttgt ggttcagttt ttttctctac 35280 
cactgcacag ttaatttcaa gagggcaggg ccatgtctaa ttctacttat tatattccta 35340 
ctgaatatta tgtagttgac atatattaaa taattttaat ttaaataaaa tgtatttatt 35400 
gagtgaatgt aggacttgct gtcatataac atatttctaa gtctgattct atcttctact 35460 
atactagata attagttcca aattttatgt atatacgtta taatgtgatt agactcccat 35520 
aaaaaagtat taaataataa agagagttaa tgtgtcatat cacaagaagg ccagaagtag 35580 
gaaagttgta gtagtttagc ctctgaatca gatggggtta taaattaact tcttttactc 35640 
accagctgtg tgagcacagt tatatattta cagatcactg aaacattttt ttgaaattca 35700 
gtctgctcaa gaaaaaaaac aggaatgaga atattacctt tcacacaggg attctttcaa 35760 
aattactgaa ttaatgcatt caaagaattc tttcagtact gttcaagcac ttcaatatta 35820 
ccaatgacag tttcctgcag tatcttggat attattcacg tggacctata gagtgttagc 35880 
catttaaaag tcatcagctg cttttgctgt ctccatgttt attgcagcat ttaggactag 35940 
gttcaactac tcataacaga aaatgttact gtagtggctt aaacacaaat aatttatctt 36000 
gtgcaaaaca ttcaggggtg agcatttggg gagaataaca ccaagaagcc agaccctttg 36060 
taacttcctg cagtggtttt agcaccttca ggcgggaaag tcaaagaaga gaggagtgac 36120 
attgggagac tttgatgtac gtatcactgc ccagaagcaa tcatttctaa cagtgaaaaa 36180 
ggaggggaaa tgaaggtttt tgcctccaca atctctatag tagaagcaga caaggcaata 36240 
ggggaaggga aaaaagtttg gtccatcata tatcctaggt gtaggtattt gagggtgatg 36300 
accataccag ttaattaggg ggagatgagt agatcctcat ctcttccaag tataggaagc 36360 
taactttgcc gtaattgtta ctaatacacc cttcaaaatc accattttga aaggttttgt 36420 
gaaatgagca ttattttctt ttaatacaag gtcagacctt cagaatgatg aataaactca 36480 
tttgagggca agtaccaatt cagagctgct gtaaagtagg cattttcaag aagaaggaat 36540 
acttctcatg tttgaactgt gtaagcaagt ctttatttat tttttgaatc ctgaaacagc 36600 
aattagaaaa gagagaggaa gatatttcct tttctgaagg ctgccaagaa taaatcctat 36660 
tctcatttgt taaaataaaa agaggtaata agctaaagta agaatacact aaccttattc 36720 
ttgaacccag ccgcctctag aaaaaaacac tataaaagtg aagctaagtt cagtaatttt 36780 
tgcagtcatt gcctttttgt tagctctagt tatctttcaa gttttattga taataaaaca 36840 
ttaattgcaa ggacaacttt tgaaatctat tattgaatac ctaggacact catgatgtct 36900 
ctccacattc ttgccaagga attgctttat gaattaaacc acttggggaa ttaaaccact 36960 
atcaattaaa catacagttg tataaggata gacatatact actttacatg caaaaattta 37020 
ggccatagca atttagtaac taccacgtta cagggactgg aacttttcac tcctccactt 37080 
cttctatctc accactctgt tatatgaagc tatgattctc cattcagaat ataactacat 37140 
tagtttttga aataggaaaa tatgaaggac agactctaga aatatcagaa tcacttggac 37200 
aacaaagtcc aaaatatgta ttatgaaagt gaattatcat gtttgactca atatgttgct 37260 
gcaaataaat aaaaattatt tgctcaatat agatactcct tatatgcctc aatattttat 37320 
atacactgac tgaatattta tgactatgta tgagcagaat aatgagaaaa aaacttacta 37380 
aaaaatctaa ttattttgtt ccttttctta ataaatgtcc cgtttcacag ttataacact 37440 
ttctggattt ttatttttat tctgacttta taaaataaac attattttta gaatagtttt 37500 
aaattaacaa aaaaacacac agagattaca gaatgctcca tatatgttat acatatatat 37560 
atatattata tatacacaca cacacatata tttaaatacg tatattatat atatatttta 37620 
tatatcccgt cccctcccct agaaatcact gaattgttta ctggctctaa agttttgctc 37680 
tcattattaa atatgtaaca aatatatatt atatatttat atacaatgta tataaaataa 37740 
atatatatac acatgtaaaa aaatatatat atattttgcc tgctccagaa ttccattcca 37800 
tccaggatac cacagaactt tcactaatct tgtctctttg ggtgcttctt ggctataaca 37860 
gtttcttagg attttcttat ttttgatgac cttcacagtt ttgaggagtg ctggtcagat 37920 
attttgtaaa atgttccttt cttgtaattt ttctgatttt ttaccatgat tagactggac 37980 
ttagggcttg tggggatgga ctgtcacaga ggtaaaatac attctcatca cagcatatca 38040 
agggtttata ctattgacat gacttaaaac tattgatgtt aacactgatc acctggttga 38100 
ggtagtattt gtcaggattc tccactgtaa agttaatttt ttccctgttt ctatattgta 38160 
ttttagctat atcaataaga actcatggac atttctcttg tactttgggt tataatccaa 38220 
tcttttttat ttttgctcaa attgttccag ctttggtcat cggacacttt aagtagctgc 38280 
tgtgtgcttt tgatgttatt gcagcttggt ctgtttagca ctttcttact ttctaaaatg 38340 
tcaagatgct ccaggcttat cttgtatatt accttatgta gctccaggat taaccatttc 38400 
tccaaggttt tgacaaactc atagtgtcat gtattcacca ttacagtatc atacagaata 38460 
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attttacagc ccgagaacag ccctgtgatt cacccagtca acctttccct cccttagaaa 38520 
ttactgattg cttactggct ctaaagtttt gctctttaca gaatatcaca taaatgaact 38580 
catgggtatg tagcttttac agactgactt ccttcattta gtaatatgga tttaagattc 38640 
atccatgtct ttgtgtatgt tgatagccca tttttaaatc actgaataat atttcatcac 38700 
atggatatac catggtttat ttgttcattc acctattgaa aggcatcttg attgtttcca 38760 
atttgggaca attataaata aagttgccac atgtataaat atgcatgctg ctttttgtgt 38820 
ggatataaat gttcaaataa gttagttaaa cacctggtag tctgattgct ggattatatg 38880 
ataatgctac atttttcctt gttaagaaac tgacaaattg tcttctaaag cagttgtgcc 38940 
attttgagtt ctgatcagca gtggatgaaa gttcccatta ctccatttca ttgtcaacaa 39000 
ttattttcat tatgttgttt tctttttggt gagttttaaa atggctttgt atattttata 39060 
tataagtaat ttattagata tgtgttttgc aaatatctcc ttttagtctg tggattgtct 39120 
ttttattctt tgaacaattt attttataaa acataagttt ttaattttga taaagtccaa 39180 
cacatctatt tgtttaattg attgtgctat tggtgttatg tttaaaaact catcaccaaa 39240 
tcctgggtca gacagatttt ctattatgtt tttgtctaga aatttcatca tttgtatttt 39300 
acatacattt gtgatccatt ttgatttaat tttgtgtaag gtataaggtc tttttctaag 39360 
tttatttatt tatttatttt tgtatgtgga tgttcaatta atccagaatc atttgttgaa 39420 
aaaactattc tctttttctt tcaacctttt tgaccctttt gtcaaagatt aaatgatggt 39480 
atttctgtac aataatcata tgatgtggct gtgtcccaac ccaaatctca tcttgaattg 39540 
tagttcccat gattcccacg tgttgtggga gggaccctgt gaaagataat tgaatcatgg 39600 
gtgaggtttt ccctatactg tcctgtggta gtgaataagt ctcatgagat ctgatggttt 39660 
tataaggggc acctctttca ctcggttccc attttctctt ttgctgccgc catgtgggac 39720 
atgccttttt ccttccacca ggattgtggg cctccccagc cacgtggaac tgtgagtcca 39780 
ttgaacatct ttttctttgt agattgccca gtttctatca ggagcacaaa ataaactaat 39840 
accgatgata tctgtctttt ctgttctgtt ccattgatct aagtgtttat tgattggcta 39900 
atatcatgct atcttgatta ctgtagcttt gcagtaaatc atgaaggcta ataggacaag 39960 
tcttttgact ttcttcttat tcttttttac tactgtattt gtttttccag gtcttgtgcc 40020 
atttgtatga aatttagaat cagtttgttt ccatctacaa aatagcttgc cacgattttc 40080 
tttttgtggt tacattgaat ctacagatta ggttgaagga aattgacatc tcaacaatgt 40140 
tatgtcttct aatgcataaa catgaaatat tgctccattt attgagatct ttgatttttt 40200 
catcaatgtt ttatagtttt caacatacaa atcctgtgca tattttgtta gatttatatt 40260 
tgagtatatc atttattgga tgctattgta aattataagg ttttttttac attttaattc 40320 
caaatctttg tctttcatgt atgagaaagc catggacttt ttatattgcc ctgatatcct 40380 
gtaaccttgc tgtattcact tattattcca cctaagcgga tctcagctgc atctgctgga 40440 
tgaggtggtg ctttatccta taacttctat tctctgatgc atccaataaa attcattgat 40500 
tttcactttg tcctgctttt ccttgtaaga acaggaataa tgaagtccaa gctttcacac 40560 
ataggagata aaactgaaag tccacacatt ctgattttaa aaaatcttct attcttttgt 40620 
tttgctattc atatgtgttt tttaaatgtt tatgcttgag gtttgaattg tttatgtaat 40680 
gtagaatcca cagataaaca caattagcaa atgtaagcta taattcttac agttttaaag 40740 
gtacctttgt atgtacccca caaaaaatag atgtcaacaa tttggaggct actttttatt 40800 
taagtaattt tacacctgct ttctcttttt tttaatgaaa cccagttgca gttctttgaa 40860 
caccaggaaa aacttcctag caacctctga tgtattcctg catctagagt agaaacacag 40920 
cacacagcac agtagagaat tgaaaaattt gagaaaaaga gtcagttgtt ggccacattg 40980 
actaagactc aggcacgcac atacacatac caaagtgatt agcaccttag ttgagctcaa 41040 
ataaggagga aaaaaaccag cattacaaga gtgagacagt ggcaggttac tactagccct 41100 
gcctgtggta tttctcttcc atttgaccag aaatggcaaa ttctgttgaa aagaatggcc 41160 
cattttcagg caaagcttta ggaagcacat gatttccaca tagaatctgg cctttgtcag 41220 
acatggtatg ggactgtaaa aatgacttat tccaagatga gaacacatgg ggtagagctt 41280 
taaaacctgc aagccatgcg ttgttctctt attgaagttt tgttaaagct gttccagaca 41340 
tttgctgcct ttgcttccat ttttggactt gagagagaag aggcatcaag acgtctgcag 41400 
gggggacatg aataagtgtg tgtgcacatg cgcacctgtg tggatatgtc aaaataataa 41460 
agtttaataa ttgatttcta taattattaa atatagcttt actaattgat agctataatt 41520 
attaaacata gctataatta ttaaatatta attattttat atgtttagaa aaaatcctat 41580 
acctttgata atattcatgt gtgcaaaata tattcttctg tcctggtaac tgctctgttc 41640 
ataatcatct ttttttggac aaacaaagaa aacatataga caaacaggat taatataaaa 41700 
tttaacaatt atatagcaat acagtggcta actatagcta ctctaattaa ttacctgttc 41760 
tagagtgatt ggactatcgc aggtataaac cctagcaggg ataaaatctt tttatttata 41820 
tgctatataa ataaagtatg ttctaaaaca cttgcatgat ttattccacg aatactgcat 41880 
ggatttcgtt gcacgtgaga atttctctgc ctgttcttcc tattaaccac taaatgttat 41940 
gtcatgctgg ggatgtgagc cctgaagcag tgacaggaaa tatctgcaaa atggcaaatg 42000 
cctcccatta ctgagagtgc tgattacaaa gctacaatta catgtggatt attcatgttt 42060 
gcagcaatat aatgaactgg tttatagttt ttcaagaaga aagcatctgt agtatagtaa 42120 
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acttgaaaga 
atttattttt 
atgaaaagtc 
gaggctatgc 
cacccctcgg 
attagcctgt 
ctcatttgca 
ccccagtatg 
gtacatgacc 
actacagggg 
agagcaggag 
tcaaatcata 
ttaacaaatt 
ataaatatat 
cagtggcttt 
tcataatttt 
ctcaggcctt 
ctgaatatag 
agtgttatca 
tagataaata 
ctgggcactt 
tgtacaagtg 
aattgctgga 
ttccttggct 
gggtcctgca 
agacagtaaa 
tattatataa 
atattaaaaa 
ggaagtaata 
ctgggataaa 
taaaaaatat 
actttgtaga 
tgaaggatat 
agagctacgt 
atcttttgaa 
cagaaaacta 
gctgccttta 
taacattcaa 
atggatatct 
ctaaatgtgc 
taaactttta 
ttccaattca 
caactagtat 
ccttcttgcc 
ggacatctca 
cattcactaa 
ctacatgttg 
gactgggcgc 
taggacgatg 
gggatttctg 
tagaaaaaaa 
ttatcaggga 
cctcattttg 
acatatccag 
cacatggtcg 
gtactcagaa 
atatcgagca 
gtaatcctta 
tcaatttaaa 
tatttttatt 
tacagaagtt 



agtgagtgag 
ttctttgaac 
tgtgaatcaa 
tggaccccgt 
catggggcaa 
gtctccttct 
agagcagatc 
ggcctctctc 
actgggaccc 
tgtgtgtgtg 



gctttcacaa 
ccaaaaggta 
gtttcctttt 
acaaaaccta 
aagttcattc 
aaacgaaaca 
ggtaaccatt 
tggacacatg 
agtttcatcc 
atatttcatt 
gacttgcttc 
tctgcttgag 
tcagacagaa 
ttcccttata 
aatattatta 
tcaatccttg 
atgagaaatc 
aaaaatccaa 
gaaatataaa 
aattaagcct 
tgtatgacta 
aattctttgt 
cagaaagaaa 
tagcagtgag 
gcttatacag 
gcatgagcta 
gggaaagctt 
ttgttttttc 
tccccttgct 
tggtgacaga 
tttcaaaaca 
taggtcattt 
ccatccttat 
ccgtaaaaaa 
cttctctttg 
ctaagtagtc 
agtaacttgc 
taacatgttt 
ttttgggcat 
aaaggtttaa 
aggaatcaaa 
acttcttaag 
gaaagtgttc 
tggtctttct 
gagatggctg 
ggcctcccca 
gccaccataa 
acctctgaga 
cctttaagct 
ggtggatcca 
cacattttag 



ctcagagacc 
tggtgaagtt 
tcacaggacc 
actgtagagg 
gaagtccaag 
gcacaatgct 
tcttgcttgg 
cagcaagggg 
tccaaggtgt 
tgtgtgtgtg 
tattccagat 
taaggagtta 
ttatataaag 
cataaaattt 
atgatgttgt 
tttgttccta 
attcttcctt 
cagtgttttc 
atgttgttgc 
gcatgtgtgt 
catgttttaa 
tccctgtttt 
actgtatgtt 
ctatttgcac 
gggacaggac 
taaatgttag 
tgggtttcca 
atcaagctgt 
aagaaaactt 
ttaaaatttt 
gcttcttttg 
cttaaatcat 
tgtgaagaaa 
acccaattta 
tgttctgatc 
aatgtcttta 
ttaatcagaa 
acactgctct 
tagcacggtt 
cagagaggga 
atggagagca 
ttttgaaatt 
ttctagaggt 
tattttgtaa 
ttaaaaattc 
ggaaactttt 
tcacggttac 
ttcccaagaa 
atgtttttcc 
attaaatagg 
tgctaattca 
gtcagagttt 
aggactttta 
taaagattat 
aggcttaaga 
cctctcttgc 
tatgtgtgtg 
ttctcactgg 
gttttgcttc 
tatgacaaag 
ctacccgata 



tttttttttt 
ctctctactc 
cagttctgcc 
atccccactc 
aacgccataa 
ttctgcggct 
attcatgttc 
atgaggatgt 
ggcaggctag 
tgtttgtgtg 
ggaagacagt 
agagaagaga 
taattttagg 
accattttca 
acaaccatcg 
ttaaacgcta 
ctgctttcct 
cctttatgtc 
gtgtatcagt 
accgcatatt 
ctacagtgaa 
caattttttg 
taacattttg 
atgtagcatg 
gtgacagcaa 
tctttcactt 
tcagtgttca 
tttgtggtat 
gatcttctga 
tgaggttcat 
gaaggtaaaa 
atagcttaaa 
gaagaaaagc 
aagagtttgg 
cctaacaacc 
tgtttgaatt 
tgtctcttca 
tgaaagagcc 
tatgtcaaga 
tgatcattca 
catcccatca 
gtattttctg 
atcaccgtgc 
gtattcgtat 
cttatgttta 
gagacagctt 
atgatagtta 
aaataaacct 
aagcatctgg 
atttcccttt 
gaaggtattg 
cactatcaca 
ggtgcagttc 
aaggacttag 
atgctgcaca 
acctgtgatc 
tgtgattaat 
atattaagga 
tgatactatt 
tgaaaaggag 
actgtaaaaa 



gtaggaagtc 
agtgcgactg 
ctgagacctc 
ctaccctctt 
gccaagggaa 
cttccttgca 
ttgtggtcac 
tgtttgatcg 
gcaggaatag 
tgtgtgtagg 
gaaatccgcc 
ataggatata 
tggtaaaata 
tcatttttca 
gtactatata 
acttcccaat 
gcatttgcct 
tagcttcttt 
acttcattca 
gcttatccat 
tatgctgcta 
gggtatataa 
aagaactgac 
cagatctcca 
ccttatggat 
tttttcactt 
aacttggaaa 
ttcagatctg 
ggttttcttt 
atatcaatgc 
aaactcagtt 
aaatgagcct 
acaaattaaa 
gtccagccag 
catattcttg 
agagaacagg 
tcctggcttt 
tgcggctcta 
ttttgccctt 
aaagtagtct 
ctgctacgaa 
attaaaaaca 
atttcgcacc 
gtacccacag 
cagcttttca 
tgcagaacaa 
tgtaagatgc 
ccattgcatc 
tagggacaga 
aaaattttga 
agaacaggaa 
ttctagcttc 
tggacacctc 
cagcaaatta 
gcatcctgag 
ctcatcttgc 
caattattac 
tcttaaaaat 
ccatgtctta 
gagggttttg 
caagggcaat 



tagatgagac 
gtgaggttgc 
aaggagagag 
gtggacaatg 
cgtacacaca 
gagtgtggag 
accacaccat 
tggggctaat 
cagagaattc 
taaggaatat 
aagcctaaat 
ccttacttgt 
tacagaaaat 
gtgtgaaatt 
cagaacattt 
cccctgtacc 
attctagatc 
cctgtagcac 
cataatttta 
tcattagttg 
tgaacattgt 
ttaagagtgg 
aagttatcgt 
ctgtagccct 
ctatacatgt 
ttgctttcca 
cttttgtgtc 
actgaaacca 
accaaagtaa 
cactgatttc 
ctcctacaag 
tttaaaactg 
agaacgaaca 
agcttttcaa 
aaaaattaac 
gaaaggtctt 
tgacctttcc 
agaaagaagg 
gtaaaagcac 
ctgcttttgt 
aagttctaac 
catatttctc 
ccctaaataa 
tccataatta 
aagggctttg 
gacacaacaa 
ttggtgatag 
tagtccacac 
tcaggtctaa 
tagagaaatt 
aaatattagc 
ttcagtgcct 
acatgtctga 
gtctgttttt 
gtgattatta 
ccttttccaa 
taaaggatca 
aggagaataa 
gcattcaatg 
gaaaaagaga 
tctgtattgg 



45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
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gcttacattt atcttttcag tggatctatt taatagaaat agaaataaaa cacatatcaa 49500 
taattgcacc aagatactga aggtagtttt tattgaaaac ttgatcacta aggaattgac 49560 
agaattgaga agtgagaaaa tttcaaagat acacatgcct gaaagcagaa atcatatctt 49620 
atatgtcaaa catgaaaatc aaagtcagct gcttgaaatg tgggcctaat aaatcttgaa 4 9680 
tctctgcaac tgtagggaat agttattgtt ttctaatcta gcacacattt tagtaaactt 49740 
ttctgtcctt aatgttaaac ttcatatttc ctacacatct tttttaatgt taattatttt 49800 
aatatgcatg catatattta ccacttatta catgggaaga aacaattctt aagtctgaga 49860 
aggttggtaa tcatttacat ttctgacaca aaactgaagg catggttgag aattagaata 49920 
attaaaatag cactggggga caacctgaaa actgtgactc cattctgata gaaggagtca 49980 
ttgtccttga acaatttgca aatatgttcc cttcttttcc caaatatgtt cccttctttt 50040 
cccataacag gcttccatgc tttatgcaac taatgcgtta tgaattgcag ctctgtttct 50100 
tacagagaag tgaatttaca aaagacatac ctttcacatc atgttcacct ttttcacctc 50160 
aatgacaccc tgctcccttc tcttaccttc cctgacacca cacagttcac catcagctct 50220 
tcctccttct caacacccta acttctaacc tgcaactgaa gctcacagtt gtgcttagta 50280 
tctgcgtgac agttcccttc acttgacaag ttttaaaagt caactttctt tgtcttacag 50340 
tttattgtct tcagtctttt atttccctta ccttttaata aaaggattca gtattccatt 50400 
tctattaatc ttttattaaa gtttattaga aatagaatgc tgaatagaaa ataggaagtt 50460 
tagctgtagt taagattgtg attattcatt atactgatat ttttcttata ttgtttttga 50520 
actgatttat gggtagttag ctccaaaact gaaggaatac tatgatattt tcaacactat 50580 
taccagcact atgtcaggta ttaagtttaa tccatgagca gagtgcaatt tcaagatcat 50640 
gaattttcac aaacttcaat gttactactg tgtttatgct gaataaatgg tttctttggc 50700 
acctctttca cacacacaca cgcacacatc tatacatggt aacataggct aaagcaatta 50760 
gagaagttca ggtatttcaa tgactgaatc acaaaaatat gttattgatc tcatttgtta 50820 
atcaatgaat ggttaataag aagattaaga agacaaatta caaggtcaca gcccattcag 50880 
ctgaagcatg ttgttatggc cagaggctca tgagaggaaa aggtgtggct aagaacagaa 50940 
gatccgttaa gctccaagcc tgttcaatta tttaagtcac tgagacaaag tagatgtgga 51000 
gtgtgagacg ccagtgtaga gccttcaact gattttcctt ggggaggttg acctgactta 51060 
atagtgtggg aaatcaggtg aattttggct gtgactagat ttgaccaaga tggagttttc 51120 
catgaaaatg caaaccaatc tttagaattt gaattccata aagcaaaaaa ttcttacata 51180 
aatgcttata attttaccac tctaccactg tgggatgcat gccacgagat tttgtgagta 51240 
gttctaaaaa gtatgtcttc ttggacaaca tccaaagtag accacatgca cactgaaaca 51300 
agccttacac aatagctttt ctatatatgt atctatatct acatctatat ttatatctgt 51360 
atctagctaa aagaaaagct ccacatactg atagaattat ataaaagcaa atactaacgt 51420 
ttgggaattt cgattataat acacttaatt gaaaacaaaa caaaaatatt ttttaaagta 51480 
ttaagcaagc atagagagac attttggtgc ttacaaggat ttctgggaat acaagcaatt 51540 
attttttgtc agggaccata tatttaggta cagatcatta acaatgcttt ctttaatatt 51600 
tatttagtga attgtacatg ttaactctaa gggcaattaa aacccataat tataatatga 51660 
gaaacttttt ctccacttag tattttttat tataaatatg gctagcattg aatcattggc 51720 
tataaccatg taattttaat gcttgggaaa catctgttag aaaaactttc cctatagaag 51780 
acatttatca atgtcctgcc ttatctctgg ttatctttgg tttattacat tttatgataa 51840 
ccataaaact gaaaccatcc tcatttacat ttttcctctt ctaaatttgg ggaaattata 51900 
ttttaagtgt ttatcattat ctcattatat aatctacaga aataataaac agcagcatac 51960 
ctgattttaa aatgagactt ttgaatttct acaataaata gaatataacg aactctgctt 52020 
ctttgggtat atcacgtcaa atatacatta atgagcagat agctttaaca tgtttatcat 52080 
gttataatct gattgttgtc tttactaatc aagataccta atttaactga agtatcaagt 52140 
gatctcaaaa cctcagtagc atctgataaa ggttgctgtc tcacctgtgc cacatggcca 52200 
ttgccatttg actgctgctt cacccacatc acctgcactc ccaaactcag cctggctgga 52260 
tatcttttta caagatacct gataatgaag gagagaagga acatggtgaa cactatgcag 52320 
tctcagaaga tctgcttgga agagacatgt tacttctgct gtcatttcct tggccagagc 52380 
aagtgaaatg ctgaacctga tcagtggctt attgtttgaa cagtaataca acctactgtt 52440 
ctatcataga atgatattgg agtcatatat ttcaattagc cattatcaat gtctctcata 52500 
attttcttta cacattttct cagaatctag agtaaaattt aaaaaaggaa agacagataa 52560 
atgctctcat ctccaaattc agtgtcctaa attccactta ctttaatcag tataaaaatc 52620 
aggtagctca gattttaatg gactagtcac ttttgtctct ttttttttgc tatcagcgtt 52680 
tatattgatt atttggaaga aaaaagagcc ccctcaacct tccagagcgg acacacacac 52740 
gtacacacac acatcggcaa cattagagag gttgacctga aagtgtttta aaaactatga 52800 
cacagttaaa ctatgatttc aaagtttttc caagcaagtt atatgtcaga atatgagtag 52860 
gggaaaacta ttgaactgcc acaattctgc tggtggtttc attggcaatt ccaattatgt 52920 
tctgatcata gtgtcatcag agcaattagt tcatcccaaa cttgttttgt attgccactc 52980 
cacagggaaa atagtaaatc tgtcctactg actgcagttc tctaaaatgt ccttaaagtt 53040 
tagttatttc aaacttagtg gaagaaatat gttgttggta gtggtcatcc ttgctagtac 53100 
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tctttaatca 
agaattattc 
ttttttttgt 
tgtatttttt 
ataacatcat 
tttttagact 
atgtatgata 
gatcattatt 
cagtagtgaa 
aattacctaa 
ctctccttcc 
cccagaacac 
attacatgca 
ctttgaaggt 
tcagagacaa 
caaacaaaaa 
ggcaaaatac 
taaatacata 
ctttcaaaac 
ggctgctggc 
tagaagtgta 
aaaggtttgt 
attccattgt 
ggccttacat 
ggttcaactt 
tagtgtcttt 
agtattttat 
ttattcactt 
cataagagtt 
cctttattcc 
gtactacgtt 
gaaaaacttt 
tgttttaatt 
tgattacatg 
cactgtttaa 
ttttcgtatg 
aattccttta 
taatgttcat 
ttgggtatca 
atattttgga 
gcaaaaaagc 
tctcctgact 
ggctttgcgt 
attactttcg 
ctatgttcct 
taaaggtttt 
cctattattt 
gctagtttcg 
gatgatttaa 
ttcattatgt 
tttctaattt 
ccacaaattt 
ttgatcagaa 
tgacctaata 
ccatagaaac 
cattcaagtt 
gtggggtatg 
cactttttgc 
atatcttctt 
aatctacttt 
aaaatatatt 



ttacctgcca 
tactaccgaa 
aattctaact 
ccatttttta 
ttccagagtc 
tcctaaaatg 
cggtttcctc 
ttttaaagaa 
ttcagaatgc 
tttgtaattg 
ctccccggag 
ttagcaattt 
ataacgatac 
ttagatacga 
gtcagagaga 
acagatttta 
acatgacatt 
tattacattg 
tgaaattcta 
aaacagttct 
atcatacaga 
ccatgttgta 
atatataaat 
ttgggtcttt 
cattccgttg 
ttcccattaa 
tctattggcc 
attagttctg 
gtctacatat 
aacatatgtt 
aaatagaagt 
ttaaaaaatt 
atgaaaggtt 
tcttttttct 
tttttgtgtg 
ttgaaccatc 
atatgttgca 
acggaatatt 
aggtagttct 
aattattgaa 
catcaggcca 
cttaaaatgt 
ttctagaaat 
taatactctc 
tttttttttt 
ctaaatttgt 
tacattctct 
ggttcagtat 
gatctttctt 
cccatatgtt 
ctcatgtaat 
gtaaaagttc 
aaatgttttg 
tatgttctat 
tgggtagagt 
ctctgttgcc 



ggaaacactt 
agctcaagat 
tttcaggtac 
tggaattgta 
atcaattcct 
tcttgataac 
tatataatgt 
actcaggaac 
atagcagagt 
cagctattca 
ctcctccttc 
ctgctccgat 
attcaatgtg 
ttgactgtgt 
gagagagaga 
agaactgagc 
tactatctta 
tgtagacatc 
cccattaaac 
acttattgtc 
atttaccttt 
gtctatgtca 
cacattatat 
gattcatatt 
catgtggaga 
atgatcttgc 



ataactacaa 
ttcatatatt 
gctacattaa 
gtctgatatt 
ttttcatcct 



aacatgtttg 
aattcattta 
atgattatat 
ctctcttttt 
gaggaaagca 
gtgatgaggt 
attgaatttt 
tcattagtgt 
ttgaaccatc 
tttgtattcc 
aaattcagtt 
gatctttagt 
ggcctcatag 
aattggggtg 
aggacttttg 
ctattacaat 
ttgtccactt 
tgctgatctt 
tttattttgg 
tactattttg 
attttatctc 
attcttattt 
gttttttgta 
ttgacatatt 
ttattatttg 
tcgtttttct 
tatgatattt 
ccaggaaaat 
gttccgtatc 
ttacttatat 
caattattat 
ttatggtctg 
accttttaat 
agtatagcca 
ttaactttca 



tttatttacc 
gtatttgtgc 
ttttctcttt 
aggcatattt 
ttgtcaggac 
caaaacacag 
gtcacataga 
aagaaataag 
gtgcaaagta 
ttgcacttga 
ccagcgtttg 
ttgctctagc 
aaactacaat 
caaccataag 
gattgacaaa 
cctaattcat 
accattttta 
accatcctcc 
aattcctcat 
tgtatgattt 
ttgtgatggg 
gacttggctt 
atgttatatt 
cagttaattt 
ttcagttttc 
cacccttgtc 
ttttacgcca 
ttagtttcat 
catctatgga 
cttgcctaat 
gacacccttg 
agtttccttc 
ctcaagtgct 
ggtgtattcc 
tttgtattcc 
aggcataaat 
tgctagtact 
tttctatttt 
aatgtgttag 
agcccttctt 
tttgtcagga 
tttctatttg 
catccaggtt 
tgtatttatg 
taacttgagt 
aagaaccaac 
agctgtaata 
ttttagttcc 
atgtatgcat 
gtttgttcat 
atccattggt 
tctgttactg 
tttaccgtaa 
gttccatgtg 
tgtccattag 
tctgtctcat 
agaactgtat 
ttattagatg 
aatatataat 
ctcttgctct 
gtgtatctgt 



agtgcatgag 
tatagttata 
agctgtttta 
ggtggttcta 
atgacattat 
agggttatca 
gtattttatt 
gatgcactca 
aagcaaatgc 
ttctttcctt 
tatggcagct 
ttcaaaggtc 
ctgtgtaact 
cagaataata 
caaacaaaca 
ttttttttct 
aatgaacagt 
atcttcaggg 
tccccccctt 
tgatcactgt 
cttatttcac 
cccttttaag 
ctaagcattt 
ttatatggag 
tcagcatctg 
aaaaataatt 
gctgatataa 
atgggatttt 
cagagataag 
tgctctagct 
ctttgttcct 
tatcctagta 
tttactgcat 
attgttaatg 
aggcataaat 
cctacttaga 
tttttgaaga 
ttgtgatgtc 
gaagtgttcc 
taaatgtttg 
tatttttgat 
tttgtggttt 
attcaatttg 
tagacttgga 
ctttcttctt 
ttttggtttc 
tttattattt 
ttaagttgta 
ctaaattttc 
tttcgtttat 
aagagtatgt 
atttcttact 
gttttttgag 
tacttaaaat 
acctagtcgg 
tgttctattc 
gtttctgttt 
cataaatatt 
gtccttcttg 
tcttcactta 
gtctttgggt 



agtttaggtg 
aatatagatt 
ttttagtata 
gtttacaact 
gtttttccat 
ttatacagtt 
tccagccaaa 
cacaaataga 
atccatgccc 
caactccttc 
tcgtaataaa 
tatggtcatg 
ttgtatttct 
attcctgtgc 
aacaaacaaa 
tttttattgt 
ttagtgatat 
ttctttccac 
cccccagctc 
aagtacctca 
ttagcatcct 
gctgaatgat 
tatattttta 
gttcatataa 
ttgttaaaaa 
tggccatgta 
ccatatgtaa 
ttggtggaat 
tttacttctt 
aggatttcca 
gatctgacag 
tactgagtat 
caataaaaga 
tggtatattc 
cctactttga 
aatggtgtat 



ttgttcccat 
tttgtctgac 
ctcttctttg 
gtagaattca 
tagtaatcta 
agatttagta 
ttggcatatc 
agtaatgttc 
tctttctagc 
atttattttc 
catttttttt 
aagttaggtt 
taacagtgct 
ctccaagtat 
tgtttaattt 
ttatcccatt 
acttaatttg 
aatgtgtatg 
tgtattgtgc 
attattgaga 
tcaattctgt 
tataattttt 
tatgttttaa 
ctatttggac 
ctaaagtgag 



53160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
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tctattgtag actgggccta catcattttt taaaatccat tttactaacc tctgtcttct 56820 
gattggagag cttaatccat ttacacttaa aatcattgct gataagcaag caattttgct 56880 
atttgttttt tatatgcctt atagctttct catacctcat tttctacatt tctgtgtttt 56940 
gtgtttagtt catctagttg atttttcagt taaatattaa aagtcttttt ttctgctatt 57000 
ttctttcttc tatattcttt gttgttacca tggtgattat attcaacatg ctaaagttat 57060 
aacattctaa tttgaatttg tgccaactta attttaataa tacactcaaa tattttttac 57120 
agctctgtcc ccacctcttt cagctgttca tgctataaaa ttatatctta tacatttcat 57180 
gttcgaaaac ataaactaat aattctttga tgcattcatc tctcaaatta tgtagaaaac 57240 
aaaatgtgga attgcaaacc aaaattacaa taccaacttt tagcataata attatcttaa 57300 
tgttttagtg tcttaaatga agtagaaaac aaaatgaagt tatacagcgt gttacaataa 57360 
tactagcttt taaaattgcc tatttttaat tcagatttaa tttttgtata gcttttagtt 57420 
attgtctagg gtcctttaat ttcatcccaa ataactcccc tgagcatttc ttgcagggaa 57480 
ggtctagagg taataaactc ccccagcttt tgtttacctg gaatgcctta atttctccct 57540 
cattttagat aacagttttg ctgaatctag tattgttggt tgacagtatt ttacacttag 57600 
tattttgtat gccttctgat tttcacttga aaactgagca tttgaatcta ataatatgtt 57660 
aactctgata accggattct ctttctttct gagggtttgc tgttttgttc ttgacttttt 57720 
tttattgttg cagctgtctc tgtgccaagt attagcatga ggtgtaaact tattctctcc 57780 
tcaggttttt ttctgaactt gtatcttccc actgtgtcta catggtgact tcctaatttc 57840 
acccatatat acagttgctt gtgaatgtct ttatcttcaa tgtcagccgt ccaaaagagt 57900 
aaaaagagaa aaacaaaggg agaaaaaatg aggcaccacc ctctttacat tccctgggaa 57960 
ccacctcagc caggggagag gggctgacaa taattgagga ggagcataac aatggccact 58020 
gtttctttgt ctgcacctct gggatcagaa gcagtgatca gtgatcagag cacgggtctc 58080 
tgttatctgg caggcagggt tgtttgttgc ccaccctagt ttctgcaagc tgtgtggtag 58140 
ctgctccagg caccgctgcc tgccacagag ttgagaggtg gggaatgaat aggtgctatt 58200 
gtgctaatag ctaaaattga ccacaattaa ccacaactga cagtctaagc catctcctgg 58260 
gagtttgaag tgcaagtctt tccctggaac ttaatttcaa tagtttccag aattccaaaa 58320 
. tagtaacatc agacagattc tgtcagggca ttattttttt ctatgtaggg atacatattt 58380 
ctagtgctac ctactccacc atctttctag aatattctcc tccccaataa tttatatcat 58440 
catttctggg gttgtaagtc aacaatctgt attttaacaa tttctctagg tgtctctttg 58500 
cctgctttaa gttttcaatc cagttagccc tctctatcct caatataaaa caaaagcacc 58560 
atgaaaacat ttggctctaa gtgtggttgc catttaattt atgttccaaa ccagaaaaaa 58620 
aaaaattgag agtaaaagaa cgtgctatta ataattcatc tgtaataata tatatgaacc 58680 
tggactgtct gaaacagttt tttgagagac acttagaaaa aattatgctt tggaaatttt 58740 
ttatggggtg tcattaattt gtttttgtcc aatttttgca acaaaatttt gacttctata 58800 
gtgaggtgac attcataaat ctgagtattc atatctaaaa acccaaaagc atttttttct 58860 
aaggatgtca tgtagatcat ataaaataac tatagaaata ttgaatccat aatatgtcat 58920 
atgagttgca gccattttaa tattttgtcc atttttatat attctatgca tataatatat 58980 
ggtagttttc tgatagtaat ttccaattcc tctcccaaaa tggactggac agatttctta 59040 
gtgagggaag actgttttta tatcattcaa attttatcag tccttttctg tgggtcccct 59100 
gctgttctta ttccatagca tcttatgtag gattgatctc ttccttgtgt tagagtcaat 59160 
attattattt tcagaagtct tggtttttcc attggcaaat gttcaaataa aactttataa 59220 
ccaaatcttt ataatgctgt tgccaacttt gtgtctttaa ataaagtcca attgccatag 59280 
ggcaattgtg cagtgggatg gatatgtgaa tctgtgtgaa aaatgatatt ttaaatgctt 59340 
tgtggggaag agtaaccagt ctggattgaa ttgaatgcta aattggcttt acatttggaa 59400 
aaaatttaga tgttatagta gttcttatat aagaacatta aacatctcac tctaaatgaa 59460 
gtacctcagg tgatactgtg atagctataa ttgtttatga cggaaatatt cactcttaga 59520 
agacttgggt attgacaaga acttggaaaa ggatcaagca gcaacatata tgaaactgtg 59580 
cttcagcagg ggaaatacag ttatgttcct gggtcactgg actcttgtta tggactgaac 59640 
aatccaagct atataaagag acataataac tatgatcaga actttgtgaa atatgaccgt 59700 
tctatttgaa ttcaggttct aagcagtatt ccccaaagag atctgatatt gaatcaatgt 59760 
tgtagctacc agccttgaag ttacctgaac tttggtattc tattattccc ttagcttgta 59820 
gaaactgact atggtgaatt cttaccccaa atgtattttt ccctagtaga tgaactataa 59880 
tctttcttat tcatagtcta tgattggtgc atcttctcta gctccctttt ttagatgtgc 59940 
tgaggaagtc cgctgtggtc ctaaagctgc acctgtgtgg ctgttgtcta atgtcccgtt 60000 
ttggtgctca atgtcatgag tactaagatg attctgagta agagtcctaa ttaatgccac 60060 
caaaatttgg cgaacccttg tcttttgggg aagaggggca gaattcttca cgatatgcct 60120 
caaacctggc ttcttcactt ctgtgtacga gataatttca aactgctttt ctcactttta 60180 
aattttattc atagatgtaa aattttggca ttttggctaa tttaatttca aatttgtatt 60240 
tgcttgttcc tgctttgatg cgctacttat gttgtgtttg aaacatagta gaaaagaggc 60300 
tgcgtcggac aagtatgaga agacatgctt acagtcacat aaaaggagaa taaaaatatg 60360 
tgtgtgactg tagaccgtac ctccatgaac attacaaatt tctgactgcc atctagaaca 60420 
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cgtgctttat caacatgata tcaatgatgt ggatatttta cttactttct attttgttat 60480 
acatgtttgg tttagatgtg acctttgacc atttctgtat aattttaata aaatagattt 60540 
aactacacaa aaatcaccct aaaagtaatt ttccatgatg cattttaaag gtcatgtttt 60600 
ccctgatgac agcatgtttt tgtagttaga ccttgggaaa gacaacaagg agatctgaga 60660 
gctaatgcaa aattatcttt actagctgga tctccttgga ctggtcattt atcctctgaa 60720 
tcctaatttg ccccttagca ctcacaaatg tcatctcccc aaagccatag ctgagcagaa 60780 
agtggagctt agcatttttt attctgatag aagagactgt gtccatgcaa ttatgacagt 60840 
aaaaataaat gagcatagct tcaaacattt ctttaataga aacttagcct ttgggtttaa 60900 
ggttatataa agtgaaagga atttgggtaa ttaatatgaa gtcaaacatt aaaagattcc 60960 

tcaagtaatc caaaggtatc aaatgttcaa tgacataaaa tacctctcta gtccttatga 61020 

tgagaagaga aggtcctgat gagtggctaa gtagtgatga aagtctaagg ctaaatttct 61080 

atgaagaaat attgtaataa aatttttgaa agatactcag agatatttct tgagatcatt 61140 

atcatgtatt ctgtacacaa atgtattctt tccattttat tttcagtgat tttccatgta 61200 

cagctataac tttagaaagg ctgattgtta ttctttggct tttctccctc aattaaatca 61260 

tgtttttttg gtggaaagag agatttaaaa taaaatggaa gcaagtggta ttaattcaga 61320 

gttactaaaa ctgaaaatta gggaaatgta gaaaatataa actaatatca agctaattct 61380 

tttctgtatc agtgttagac cttttatggg tgaatgaggt taaatgtgtg ttagtataaa 61440 

ggatgtaaag gagtaagcaa agacactaag accagagcat gactttatat ctatatcagc 61500 

ttgagtctat ttatagtaga atcactagaa agtggtgacc acacagagtt gcattttttt 61560 

cttttgatat gcatacattt tccctgaaag aaaaaaggaa ggaaggaagg aaggaaggaa 61620 

ggaaggaagg aaggaaggaa ggaaggaagg aaagatggaa ggaaggaagg aaggaatgaa 61680 

ggaaggaagg aaggaaggaa agaaggaagg aaagattgag gtaaggaaga aaggaaggaa 61740 

gggagggaga aagggaagaa ggaagaaaga aggaaggaag gaaggaaaga aggaaggaaa 61800 

gatggaagga aggaaggaag gaatgaagga aggaaggaag gaaggaaaga aggaaggaat 61860 

gaaggaagga aggaaggaag gaaagatgga aggaaggaag gagggagcaa ggaaggaggg 61920 

agggaggaag ggagggaggg aagaaggagg gagggaggaa ggagggaggg aggaaggaag 61980 

aatatttttt aaaaagaaac tctttgatga ttcaacatca tggagaagtt attattatat 62040 

tgagcttgac ctggatctgt aactcttgac atttagctct tgaaaataca tctgttttct 62100 

tggctgatag tcttaaaaga cagaatccaa cataataagc ctgcacatgt atccccatat 62160 

ctaaaatgaa tgttgaaatt ataaaataaa atacaatata tgttcacaaa tattaaaaaa 62220 

atgtgcagag atatgggaat acatatgtgt cttatgtgtg cttatttcta tgaagtcgtt 62280 

agatgtcaaa gctctttctg catgcatcct tgcttagacg ttttgtctga aattctgttc 62340 

acattgacat acgtgcatta aactaatatg agaatgtatg cattaaacta gcttgaatga 62400 

tgtccacatt ctcagttttt tcatcataac tttttctttc cagagtgcat gatgccgcag 62460 

aaaagaaaag ctaactgtga aactcaaaga aagtctttgt tagggcctga gtattttttt 62520 

tttaatctct atgactttca cattcttagc ttcagtaatc cttgaagaga acagagtcaa 62580 

accttgtatt ttaattagga tctttcagtg tttgtggcct tgaggagcga gagcaagaaa 62640 

gacattctat gggtgtgcgt gaaagagagg accaaggttg ttttctttgt gactagagat 62700 

tatattgtgt aagagcagat gacaaccttg agattcgatt tgtgatatga aaccaaatgc 62760 

actaaaagga aagcgtacca gtttggatta gatacattaa atcagtatgt catgtaaatc 62820 

agtgactatt aaaagagaat atataggaga aaaaggtatt agaagttgca ttttttaacc 62880 

aattgcaacg tttgcaaatg ggcttgtcac ataccatcat gtaagtacta gtgaagtcta 62940 

aggtataagc aagtattttt catttttaag gatcattgca aagatagaga atttgacaat 63000 

aaatgaactg actggccttc tacaagtatt tacttacaac cacaattgta atttttccat 63060 

ggccaagcta tgaattatta aaagaatatt gacattttgc tatattttta aattatcttt 63120 

aaatcatttc aaatagtgaa atagtattat actaattaaa ctttattaag tacattttca 63180 

agctcagcta agttttttaa tcaatgttta tttcagttag gggatagttt tgtttttttt 63240 

cttctttgaa ataatatctt tgaaaccaat gtccacattt ctttacagat aaaagagttc 63300 

tgtttatgaa ataagaataa acagcctgta attttaatat caccaaaact attttaatct 63360 

ctttttggat atgtacttca acttaaacta catatatagc tatagatata tagatatgtc 63420 

tatatctaga tgtagacaca tcccccaaaa cctatagaaa tgtatttctc aaatgtgtaa 63480 

agtatagtga aactgccaga ccacaagcga tgttttgtat ttatagatct acttgcaatt 63540 

tggggtgtgc caaatcccca caccttgttg tcaatattag tttaacaaaa ctaaaaggaa 63600 

atttaaaata ttaagatcat cctctttagg atctacttgg agcgctgaaa aaggtgtgaa 63660 

aatactgatc tcggacttct tgtgagtttg atgtgtttta catttgtaat gctattctct 63720 

tcactgcatt catgcgtgtg cttgaagctt gctgccttgt aggagacaat gcaacaactc 63780 

ttaccaatgc cacttttata agattttgcc tgcagtgtgt caatgtgact cttcagtacg 63840 

ctacttaagg gagtgcaggc aagatgacct accctgagtc atccaaagaa tatttctcta 63900 

gttcctgttg gtcctgtgaa ttcctaatgt ctttataata aatggccacc taaaaattaa 63960 

tgtaactggg ttgattgtgt tgcatgacat taagtaaacc taactgccgc aacagggtgc 64020 

agtctgcatc atagcaaata cgttacggga gagaggggca cgtcctccct cagcgcccac 64080 
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aacagggatc attaaccaag gctgacacaa ttctccagtg agaacagatg taaccttgga 64140 
atctttctcc acacagtgct ccaggaagcg attcgacatg agtctgtact ggcataggaa 64200 
aataaaactg gcatggctgc cacatagatc ctattttctt ttgcctctta ccaagagtta 64260 
ttttaaaaat gaaactattg tcggccaggc gtggtgactc acgcctgtaa tcccagcact 64320 
ttgagaggcc aaggcaggca gatcacgagg tcaggagatc gagaccatcc tggctaacac 64380 
agtgaaaccc cgtctctagt aaaaatacaa aaaaattagc tgggcgtggt ggcgggcacc 64440 
tgtagtccca gctacttggg aggctgaggc aggagaatgg cgtgaacctg ggaggtggag 64 500 
gttgcaatga gcagatatca tgccactcta ttccagcttg ggcgaaaaag tgagactgtg 64560 
tctcaaaaaa aaaaaaaaaa agaaactatt gtcaaccaat gagtgtaaag gtgttatttt 64620 
tgcttttttt taagtttatg ttttgaaatc ctctctcaat tgcaggaaaa aaaataagat 64680 
acagaatata aagagacagc ataaatgata gaaaacaaaa taagcatgtt catgagccag 64740 
aaatcaagaa gaaaaggcaa tgataaaaag ggctgaagcc ggaggcatcc caggccctgg 64800 
gtctggaagc aggtactcac agtggcattc ttgggcactc tggggcaccg aggtctgtaa 64860 
gtaccactca taacgtgact ggccagcaaa atttactgct taacataaac aatgtcttgg 64 920 
cccccacctt gcaaaataat gctgaaataa aaggtgttac tgatcattgt ctgggactaa 64980 
ggtttagtga ggtttactta cagaaaataa aaaagcaaac agtcatacac ccaagattgg 65040 
gccaagctgc ctgttggatt tgacatatcc tgcaatcaca gggaaaaggt tggtactata 65100 
ctgagatgtg gctcaggatg ggagaccaca gggcaggcag gaaggaggca aaagcaatct 65160 
agaagacctg tgtgactagt gagaaaagca agagtaagag aaaggaagaa agagagaaga 65220 
tggatgtcct cttgttaaaa taaatgtact aaaacaaata gaacaatatg aaaagaccct 65280 
acataggtta agtatcttcc aaaaaagaca cagcattcat gtgagcttca tattgctact 65340 
atagcaaatt gccacaaact tagtagctta acacaacatg cattcattac tttacagcag 65400 
cttagacatc caacatgggg cccactgggc tcaaatccag gtgttggcag agctgctttt 654 60 
ctttctgaga ctctagagaa aaatygtttg ctttggtttt tcagctctgg aggctgctgc 65520 
gttccttggc tcttggctgc ctctccatct tcaaagtgca tcactgcagc ctgcgcttca 65580 
tccatcacgt tggctctgtc gctcacattt ctacttctct cttgcaagga cccttgggat 65640 
tctattggac tcacctggat aatccaagat aatcttcatc ttaaaatcct taatttagtc 65700 
acacacagaa aatcacctct gttatgtaaa gtaacatatt tccatatcct ggaggaaggg 65760 
ggcattgatg tgtctaccac agcatccaca aagggaacaa aaaatattgt ccaataaatg 65820 
catatagaaa taaatttgtg gattcaggaa aaagtggagt catggaacaa aaatatttaa 65880 
tagatagaag agtttccaga ctagacatcc aggaaagaat tcacccagga tgcagcttag 65940 
agaaaatgcc aaggaatatg cgaaggaatt gcatgcatgt acatgcgcgc gcgcgcacac 66000 
acacacacac acacacacac acacacacca catagaaaag aacaattcac agataggagg 66060 
ggtagattta gagactccga catataacca tgaggtttac aataaaaaga ataaaggaga 66120 
aacaatactg aaaaaagtaa ggaattttcc acaattgaag aaaaacatga gccttcagat 66180 
aggatgtgcg .tgtctcgtac tgagaaaaat aaatgaaaat aaatccgcac agaggcacaa 66240 
agtaatgaaa ttgtagcacg tcaaaatcca tattctcttt gggtttgaca ggaggaataa 66300 
attcaagaga tgtattttac aatagggtaa ctacagttaa taatatttat tatactcttg 66360 
aaagttgctg agtagatttt aaatgctctt actataaaaa tgataagtat attaaataat 66420 
gcatatatta attagcttga tgtagccatt ctacagtata taagtatttc aaaacaacgt 66480 
tttgtacatg ataaatacat agtcaaaata atttaagaat ttttaaatga taaaaaatac 66540 
attttaaaaa ttggaaggcc atattcttaa aaactaaaca tgagaaaagt attaccaaca 66600 
aaaaataaca attaggcctc tctagaaacc taactcttaa aaacagacat tacagaatac 66660 
tttcaaaatg ctttggcaaa aacaaaaaac caaataacag ggaaagagct ctatgaatta 66720 
aaaattttgc actcagataa agcattaatc aagatgaaat acaaaatgaa gtacatggtt 66780 
tcttacagct cagttttcct atctggatta actttagttt tgcaatagta tttcttgcag 66840 
taacgttcta ttatttgtgt gtccaaacat gcatatttat ccagcaacta attcaaggta 66900 
tgagaggaat gaagatactg atgacagaaa ttagagcagg gaatcagaag agagaagtgc 66960 
atgttccctt gcatggcact cttcagttga caaaaggaca tgtgatccca agatttctct 67020 
tgactgagac tacatttaca caaattaagg agttgtgtaa cattaaataa ccaaagtcat 67080 
agccttagga acttccattg caaaaaagga gtttaacatg gagctgttgt taaagggagg 67140 
gtcagatggt cgaaaaatta attggcaaac aagttgattg cataagctgg aagtactgtt 67200 
gaaaactttg tggttcttta ttaaacgaaa acacttcgga tattgggaag acagcaacag 67260 
tgaaatttcc ttggaatcag gaataagggc ctttatatat tctactcact gtctacatgg 67320 
tcatgatctc aagttattgt ttctcataca tatcagagat gtgctaaaat ttgaaatagg 67380 
ttttcaatca ggagccaatt ctaatcacac atcagtggca acttggagcc ttgagtttgg 67440 
aaagattttc tttagtgtgt taggttgttc ttgtgttgct ataaagcaat atctgagact 67500 
gggtaatttg taaaaagaga ggtttaattg gctcacagtt cttcaggccg tacaggtagc 67560 
atagcaccga catctgctcc tggggagtct tgaaaaagaa ggcaaagggt cagcaggtgt 67620 
ctcacatggc aggagcagga gcaagagtag ggagaagtgc cacactttta aacaatcaga 67680 
tcccttaaga actcactcac tattgcaagg acagcaccaa ggggatgatg ttacaacact 67740 
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catgagaaat ctatcaccat ggtctaatca cctcccacca agtctcacct ccaacattgg 67800 
ggattacatg tcagtgtgag actcaaagaa gacacatatc tgaactatat catgtacaat 67860 
tccaagacaa agtctactaa agtctgagat caattaattg tagatatctg tcattaggta 67920 
actgaaatag aaggaatggc cacagaggtc ttgtaggttt gtggcttgtg caggtgtctg 67980 
ctgataaatg tttaacaata ggttttcata ggtggtcagg gaaggcctta tttgtagcat 68040 
ttgccaatct ccctgatata aatattccaa ccatggatga tttactcaca agatggaact 68100 
gaattctgac cacagctgtc tcttgctagc tggtgtgagt cagctccaga acctcaggag 68160 
tctggtggct taatgctgaa acttagtatt taaagttgtt atcttttgtt cacctgtacg 68220 
gaagtaaagt atgggtgtga gaaaagtttc agttagttta gttagtattt ataagaaaga 68280 
gtttaaaaga aaagaatgga aatggctcgt gggtcgcata attgagacta tgaagagtgg 68340 
ttgaataaat tggtgtgttt tatcctggag aggagggagc tggagagtga tttaataatc 68400 
ttcatattta tggaggtttt ttttttttga cagaagatgg ttcccagtta ctttccatcc 68460 
ctaagtttca aatcaagata aatctagcct taagtattcc cctttcctat ccaataatta 68520 
tcgctttaaa aatattttag aaaaaaatgt actgtataaa gaatgctttc ttagattggt 68580 
tatcagttga tccagtttct acactttaaa tcctgctggc cttttacact atatgtaaga 68640 
ttaggcaata aagataggga aggttcaagg gaaaacttct gaggccaagt aatccaatag 68700 
gaatccaata gcaatatggg aaggcaggaa ctattttacc caataactct agctagaaat 68760 
ttcaacaatc accaaccatt cctgctcccc cacatgtcat tcatcacact gcttcatgaa 68820 
ttctaactcc agaagtcctc ttgaattttt cccagcctgt cctgtcaaat cgctggtcta 68880 
gtgcaagcta ctctttataa tattgcaatc tgttcttatt ccactttaaa taaacagagt 68940 
agagaattgt tgaccagatg aataaaaagt tggccaccaa tttgacatta tccccctaac 69000 
taacattatc ttaaagtatg cttctcagga gggcatttta agtcatttga agttctaaat 69060 
gcaagactga atatcttctg cacctgtttt cattggttca atatgaaaga ttatactatt 69120 
tccaccatgt taccaaatat gattggcatt gtgttaaatg ttcttaggtt taatcactct 69180 
gaatcctttg cctaaaatgt agattttttt ttttgctata aaaaagacat tagagtctat 69240 
gccaaaattt gcaattcttc tataaaatta ttttttgtct aaatcttaaa aataacttaa 69300 
aatcactaaa tgttcaggga aatctctagt gagaaaaata aaatttatag ctcacagact 69360 
agattgaaca gacaaccatc tgttcaactt aaaacatttg tctcaaataa cataataact 69420 
cacaaatgat gatggattag gtaacttcat ttcatattta ttaaattatt aacccaactc 69480 
tttttaaagc catcaatttg catgaagctt atgagtgaga gatatacaaa aaaaacttaa 69540 
tatttgatga tctcctaata tgtgctaggt gctttatatg tatgacattt taacaaatat 69600 
ttaagtcaga tactatctct tttttcagat aaagaaattg agacttaagg taaataattt 69660 
acttacagtc acaaaaatga agtacgctta tacaatctga atctactgat ggcttcaaaa 69720 
tccttttcta cgatgaaatg tagcttcatg taggggagta taacatggaa ataaggcagg 69780 
tagtttttcc tttccaaatt tttagtatag atgaagaaaa agacattctc aattatcaat 69840 
actgtttaaa aacaatgtct ttgatttaat atatggtaaa tattaaatat gtggtcgtga 69900 
aaataagtat atttttagag ttcaaagggg aggaaaatat cctttgtaag ctggggcatt 69960 
ttcaatattc aattttattt attttttata atgaaatagt cagcactcca accatccatc 70020 
tatccaccaa ttcatccatt tatctacctg tccatctatt catccatcca aacatctatg 70080 
tatccatcca ttcacacatc cattatttca tcctgcattc atacatccat aactactctc 70140 
taagaaagaa gaaaaattta ttcaggcttc tatgtagatg gaagtgtttt gatactaaaa 70200 
tgtggatgtt tgtaatttta ttattgccat ttcctttcca aatgtatgcc aatcatacat 70260 
ttttctaaga cttatacatt acctctgaaa tgcttcctct tggaaatgat gcacgtcact 70320 
tctactgaca ttttattggt caaagcacat cacatagtca tgcctgagcc tatcctcagg 70380 
aagacaaggc catttgacca ggaaacacag gagatccaga aatattggtg aacactggta 70440 
acgtccatca ggctgaaaaa atgtggtata aatattaata tttaagacac aaagaggtat 70500 
ttaccatgat aaaacatatt taagacaaag gtttttacta agagcaaaaa cataaacttt 70560 
gtgttagtca gttctgtcta ctctaacaga ataccataga ctgggtggct ttaacagcag 70620 
acatttattt ctcagccagt tctgatggct ggaagtctga gatcagggtg acagcttctc 70680 
ttgaggacca tccttctggc tgcacactgc tgacttctcc ttgtatgctt aagtggcaga 70740 
tagcagcgac ttcctggagt gacaagggct tcaatcccat tcgcgagggc tccgcctcat 70800 
gagctaatca cccaaagtct ccacctccta ataccaacac cttgaggatt agaatttcaa 70860 
aatatgaact ttggggcaca caaatattta gtccataaca gccttggagg taaaattcta 70920 
agtagcagaa atctttcatc tcttctaaaa ctaacactag tggaaaacat aatctttcat 70980 
tcaacgctta gagtctgtct tggagagcag gaacagatgt attatattac atgtgcacac 71040 
atacacatat acacacaaac atttgtttgt atgtgtgtgc ttatgtggat atctatatgt 71100 
catatataca ctgcatctag cacaataggt aattgctatg tcagggcaca tttattatag 71160 
ttttctctct aactcataag ggagagtttt ttaatcagct ggtaatatca tatatttcag 71220 
ttttttcttg caaataagag tggaatagaa agattttcgc tgttgagaat tgtgccctta 71280 
agtgttaaca atattcacaa gtgtttaaag aaataattag gttcatttca ttctatcatg 71340 
cttgtactat atttataaca tgacctttgt tctatttata tatacaagta atatattcta 71400 
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ataggaattt ttttgtgaag aagaaacact tgataatatc ccaattccct aacagaatta 714 60 
tacaatagtt cctcaatgac cttaccttgg gaaaaaatta aatgaaagac acctgcatta 71520 
gagatcaatt tcctgatgct cagtttatat tcatcatgct aagctatcca aggacacctt 71580 
tcttgtccta tcatttgatg gaaacgattt ccatgtgttt tatgattcaa ctccattctg 71640 
ccttagacaa tacagtcact tcaaacagtt caactctaca gttggctggc tgactttgcc 71700 
ttgtaaaaat tgcttttttt ttttttggtt cagagtattc tatattttac ctaagaagtg 71760 
gaaaacaaat tttgggcaag aatagtggca tacacattac agagtgattt cacatttcaa 71820 
tattatttac aaatatgaca tcaaacatta gtggttttat ttcattttta tatcagaggc 71880 
attctaaata tgatgtgaca aagttaactc cgaatttcct ggtttttgtc agtttaaccc 71940 
agatctttgc tatgcagttc tgaacactgc catgctttca acttttcaac aagaaacagg 72000 
tggttaatac acacatgtaa ctctacaact tgatcttagc atttcatgta atactttggt 72060 
aatgcctaaa tattatgttt gtagttgtcc ggagcatctt attttcactt tttcaattct 72120 
atcaaagagt acattaatag aaaattaacc aaaactgaat aagaataaaa cctcacttcc 72180 
ctaatttatc ctcttataga aaaatatttt cagtgtttat atttatgtga taccagccgc 72240 
atctgggttc agtaagcacg tgtttcgagc atactttgag atgcattgtt ccttggcgca 72300 
cctcagtttg tactggaatc attaatgcat aatctgaagt tgtgattatt gagatttttc 72360 
actcaataat gggattcaaa tgctgccctg tttgatctgc attaactgtt tggaattcca 72420 
gaatattgac tggcacttca gaagtcacca gagtattctg tagccttcag agggtaaatt 72480 
tcagtcacgt ggggatgatt tttttggcaa agagcatttt gcactctgag tcatcacaca 72540 
cattctcaca agtttttaag cccaggacag tctccattcc accacttgta ggtaaactgg 72600 
tgatgatttt aaaggtaatg tcagcctgag aaacagtgaa atatgctgaa gatatggaca 72660 
caggcacgga gtctgcgtca tcttagtccc taactctcac cttagtgaat tgttccaaca 72720 
atgacataat gaattatatt taacttttcc tgctgctcat gttttgatat ttgtcatcaa 72780 
tcggagctgg tgagagtttt aataaagtgc ttagcatcac ctgagggtaa atttctatat 72840 
cctttggatt ccttgatcct cctctgcact ttatgtaatg agtgaccttt ctatctagtt 72900 
caaatagaga atgtagagaa gtaaagtgct tcatgcaaga gggcatcact gatgtcaaaa 72960 
tgcagggcca acatgaaaga agggcagaaa ggttccggac acaaagggca gggcactgcc 73020 
tggactggct agtgtcgggg tttcagccag aaccttcctt tagaatttgg aaactatgga 73080 
gaaaacacag tgatgcagat caatatatac cagttttgtt acagatgttg aagaaggtgg 73140 
ctcactactg tgatcataca tttgggcttc ccaacttttt ctcttgaatt aaacatatta 73200 
ctaaattggc aggctttttt cctaacaact gtgaaatgct agtctttaga aattttcagc 73260 
atccagcagc aaataacatg catgtccaac ggcgaggatg gctccaaaaa agagagaacc 73320 
ccaaatgaaa atgagtcaag cattcttttt tctttaagca gagcaagatc attcttatca 73380 
gaggaaattg tgagcagggt acaagaagct gcacatgcct aggaggtgag agttctgtct 73440 
actccaggga aaaatgagaa gtgttgaaca agcagcactc ccagaaaata acacattgta 73500 
gtagacgccc gttacggttt ctccatgtat taaattggtt tataacattg tctttttgtt 73560 
tgtctctttt gagtttataa gttcccatca atattttatt tcctttactt attttgggtt 73620 
taatttgtgc ttttttctat ctttaagata aaagcataga tgataactga ttttagactt 73680 
tgcttatttt ctaatatagg catttaaaga tgaaagtttc ctcttacaac tgatttagct 73740 
gtattcttca aattttacct ttcaaaattt tatagggttt atttttatta ttgttcaatt 73800 
tgaaatattt cctaatttcc cttgtcagtt ccacttttac atacagatta tttaaaatta 73860 
tgttgcttcc caaatatttg ggaattttta cagatatatt atactattgt ttgtgatttc 73920 
taatagaata ccattatagt cagagaaaaa tgtcatgtat tatttcaatc cttttaaatt 73980 
tatcacaact tgctttgtgg ttgagatggt gatctacctg agggaatgtt ttatatgcat 74040 
agaacagata aagtgtattc ttcaattgtt gaatataatg ttctatatgt gtcatttaga 74100 
ctaatattga tattgttgaa gtcttctagg tccttattaa ttttttatct attagttata 74160 
ttaattgctg aaaaagcatg tttaatgtct gtgattggag tcttttctgt aattctcctt 74220 
cagtttttta aattgttaca tttatgtagt ttaaagcttt gtgattatat ttatccacat 74280 
atttaattgg aaatttttgt tgttttaaaa atgtttaccc tttctggggc tttttcttcc 74340 
ttccaacctg gtatttttcc tcttcagcct gaagaagtta cctcagcatt ttttatagaa 74400 
caggtctgct ccagacgatt cagatttttc tggtgaacat ataaaaatat acagaaatgc 74460 
ttttaaggca ttgtctcagg atatttgcta tcatgtttac agtcttttat cttctactat 74520 
ctttcaactt tttttttttt tttttttttt gagacggagt ctcactctgt cgcccaggct 74580 
ggagtgcagt ggcgcgatct cggctcactg caagctccgc ctcccgggtt catgccattc 74 640 
tcctgcctca gcctcccgag tagctgggac tacaggcgcc cgccaccgct cccggctaat 74700 
tttttttttt tttttttttg tatgtttagt ggagacgggg tttcacagtg ttagccagga 74760 
tagtctcaat ctcctgacct tgtgatctgc ccgccttggc ctcccaaagt gctgggatta 74820 
caggcgtgag ccactgcacc cggccatctt tcaacaattt aaaaacattt tatggtcttc 74880 
tagtttcatt catcttatga gcacaatgcg tgttttttat ctcctccagc ctctttcaag 74940 
attttctatt catgtttggt tttcatcacc gatttgatga tcgtttgcct agttcttgtt 75000 
atatttgaat gtatttactg tttcttatat atgtgagttg atatatttca tctaatttag 75060 
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aaaattgtca 
acaagaccct 
tctccttttc 
tcacccaggc 
tcaagtgatt 
gcccagctaa 
cgaactcctg 
catggctctc 
ctccctcctt 
gtttactgat 
catctcaata 
ttctctgcta 
tctgacatat 
tggaactgcc 
cctataatgt 
ttaacagggc 
ccagctgctg 
acccagcctt 
cttgagtgga 
tctacccagc 
ctgctcccag 
ccaaacttgg 
ggtgggtgca 
aagcttttta 
cctcctcctc 
ggaagagatc 
ttcattggct 
tcatttgtag 
tgtttataac 
gaccagaaaa 
cactgaggct 
tttgccttcc 
aacacctgta 
tcgagatcag 
aggtctggtg 
ttgaacctgg 
gcagcagagc 
tggaagaagt 
gaatctttat 
ttttaaaaca 
ggttttcaag 
gataattatt 
gcaaggaaag 
caaagaattt 
tctcaatatg 
tttaaacaaa 
gattagcaac 
gcctacattc 
aagcaaaaag 
aactgggtca 
tgttaggcca 
atcagttgtt 
ttatttgaaa 
tgttgccttt 
cagtaatatt 
taatttaagt 
tgtaaaatta 
gaagtattgt 
aaaatataag 
agaaaaatta 
ttacaatctt 



cccatgttct 
taacgtaaat 
gtttttattt 
tggagtgcag 
ctgctgcctc 
ttttgtattt 
acctcagatg 
ctttccttct 
ttttttttct 
tcttttcttt 
ttttattttt 
aaattcttca 
ctattatatt 
tctatttctc 
atatatatat 
atgagatcta 
actccctgag 
ttcacaactc 
ctcagtgaag 
tttccgcaac 
cctcaagtca 
gtgcattatg 
gagctgcctc 
aaatgtgttg 
ttcccatcac 
atgactttct 
cttataaact 
ttctcttatg 
atggtccatt 
aaattgtaat 
gtgttgtttt 
aggggtatca 
atctcagcat 
cctggccaac 
gcacacgcct 
gaggtggagg 
aaaactccat 
ttatgaggta 
gtcttatttc 
tttgtagatc 
gaatttatat 
tgttaatgta 
acccaaagaa 
ggtgatatga 
aattaggaaa 
tgatgagtct 
aatggtgtca 
aaaacaagaa 
ccttcctaga 
tgtgtccgtg 
attgtatttc 
atcactctac 
agttatttga 
aaaatgataa 
aactattcta 
aaggaaaagt 
tcaatgtttc 
taacatccgt 
cacatttcaa 
ccaaatagca 
ataataataa 



cttcaaaaat 
atatttgatg 
tattttattt 
tggtgcaatc 
agcctcccga 
taatagagat 
atacacccac 
tctcttctgt 
tatatttcaa 
ggtatgccag 
tttccctatt 
ttttttttct 
taatataaaa 
tttcttccct 
agcttcttca 
agcaagggag 
tccgggcttc 
agatttctac 
tccccgggag 
agctttttgg 
gctatataag 
tccatgtact 
tgatggagct 
aaaattggac 
tccacctagc 
agattttctt 
gtgttttgtt 
gctttctaat 
tgcagtttat 
ggcagatgtt 
taatcttact 
tccatataaa 
tttgggagat 
atggtgaaac 
gtagtcccag 
ttgcggtcag 



ctcaaaaaat 
tcctaagaca 
gtccacagct 
aagctatctg 
taagattgat 
gatatttcat 
agcatgtaag 
taaaatccaa 
gccttcagct 
tttctttttc 
aggactgaaa 
gagggtagga 
agcctcttgg 
tctaggtgtg 
atctcttggg 
ctaagtgttc 
tattcaatag 
atagcaattt 
ttctcataaa 
agagtttatt 
tttcttgaac 
ggaacttaga 
atttatgtga 
tattgtatta 
cagataattg 



gtcttctacc 
gcataatatt 
tattttattt 
tcggctcact 
gtagctggga 
ggggtttcac 
cttggcctgc 
atttttctcc 
tgtgggtgat 
tttactaata 
cttgacattt 
gaaaaattgt 
ctcctctctg 
tgcccatatg 
tgtcttgtaa 
agcctcggat 
cagtgaattc 
gctttgctgc 
ccttggaggt 
cttagagggt 
cctccatttt 
aaagtaaagg 
tcctcaggga 
ttgtttctca 
atcagggaag 
tcagttaggc 
attgttgttt 
tcctaaccag 
aggggtggta 
atgttgtctt 
gtgcatggca 
taaactgctg 
gaggcaggtg 
tcgtctctac 
ctactcagaa 
ccaacgttgc 
aaaagtgaat 
tgtacaaata 
ccccacccac 
gttctaaaac 
tttcaaatcc 
agagacttca 
tagatatcct 
tttctaaata 
acacgtaaaa 
cttaaatctc 
gctagtgatt 
tacagcgtca 
cagtcatata 
agagagggat 
aatgggggga 
ctctgaaata 
ttataatggt 
aatatctaga 
tcttgtatca 
tcccagtcac 
cactgcacct 
aaaaatacca 
attttaaata 
agaatgaaat 
tatacatgca 



tcattatctg 
ttctacacat 
ttgagaggag 
gaaacctcta 
ctacaggtgc 
catgttggtc 
caaagtactg 
ttcttcctct 
ttctattgat 
gacctgtcca 
ggctttttag 
ctaaaaatag 
ataatttcaa 
gcatagttat 
ttttttacga 
ttctctttgg 
tctttgctgt 
tcagctctga 
gtttcccact 
tttctcatgg 
tcagatcccc 
ccccctggga 
cctgagtcat 
catccctttg 
cacttcattc 
tttttttgca 
tcaagttatc 
aagtggaaac 
tatgaatcaa 
aagccttgct 
ccttctcatc 
gaaggccagg 
gatcacttga 
taaaaataca 
ggctgaggct 
tccactgcac 
aaaaaataaa 
ctagtgtaag 
accatggatt 
atttgtagat 
tgtgtgctct 
ggcaaacaca 
aaatcaaatt 
atgagttttt 
gaaaatctaa 
ctggaatcct 
gcttttgctt 
ttcattctgc 
cttaacacct 
ggaaggtggg 
ttgatattgg 
tattacttga 
aatccaatac 
gtaatctcat 
tttttttgtt 
aaatagaagt 
tgaattttga 
gtgctaacca 
atctgttttc 
gttctgtaat 
atattaaagt 



tcctcttttt 
ctcagatggc 
tctcactctg 
cctcctgggt 
ctgccaccat 
aggctggtct 
ggattacagg 
ccatctctcc 
ctatcttctt 
attaattccc 
ttgtgtttat 
tgttagatta 
catctatatt 
gtttatacat 
tcccttccat 
gctttataac 
ctggttgctc 
gcctttgagg 
gctgtcctac 
ctctcctgag 
tactgtgccc 
cagagaggtg 
cacacttcat 
ctggcaaatc 
ctctctcata 
tctccagctc 
catcttgttc 
tgctagctat 
gtgtggaaag 
ttgcttttgc 
acactagtgt 
cgtggtggct 
ggtcaggact 
aaaatgagcc 
ggagaatcac 
ttcagcctgg 
aataaactgc 
acccaactgg 
taagaaagtc 
caagctatct 
agtctaggag 
tgaatgagga 
gttatttgac 
ctgttaaagc 
ctcatagagt 
gttgacattg 
tagacttcat 
atcatcagga 
ttttggcaga 
aacatggagt 
gtagacaaaa 
ttctggaact 
aaaatatata 
ctacatttga 
aaattgtcat 
aaccggttgt 
ttaaagagtc 
catcacatct 
tttggcttgt 
caagatcaaa 
atatgagcat 



75120 
75180 
75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
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attctttaga 

gttcactgaa 

tttcttttac 

ttgaaaaatc 

gaattagaag 

tggtttgctt 

caggatgaca 

ggatgtgtgt 

tcttggcagc 

ggggtaagtc 

acaacctagt 

ggtttctttt 

tcatgtgtgt 

ctagggtgac 

gataagacag 

gcctccaaac 

aacatttagg 

gtgtgtgtgt 

atgtagatcc 

ttgtcacaat 

taatgtaaaa 

ttaagctgtt 

tagatatttt 

aatagatttt 

gaatgcttgt 

cttcttcttt 

aaaaatctga 

tataatattt 

acacatggtc 

tctagtggcc 

atgagcatta 

tatgaacaat 

gacagaaaga 

accttgagga 

ttgattttga 

ctcaaattta 

cttctacaca 

cataaaatta 

cccttccctc 

agagaaacta 

tcaggaaatc 

atgtttgtgt 

aaaaaaaaaa 

tcattgcaat 

tctatttagg 

ctattaggac 

aatatgcagt 

tataatactg 

gaaaaatgtt 

cagagctcaa 

atgtccttca 

tgctaggttt 

atatttgata 

ttgacctggg 

tgcaacacag 

gctcctctcc 

gctcagaggg 

caccattcac 

ttactgttgc 

cattgcataa 

taaaatatat 



caaaatatcc 

taaatgaaaa 

ttcagtatga 

tggcataatt 

agttggtctc 

ttattactgt 

tttctatcac 

gtatttttgg 

ctccttggta 

tcagtcatcc 

tctgtcaagt 

ggcaaaagaa 

acgtaagaac 

ctggtgagaa 

aatacatttt 

caatataatt 

gtttttttcc 

gtgtgtgtgt 

tagtggtgat 

aaatttatat 

gagttaaata 

aggcatgaag 

tattatttgg 

aatcaatcac 

aaagcatttt 

taatcgtttc 

attcatataa 

taatgtcaat 

gttcttaagc 

ccgagctgca 

acccaggcat 

tcacagtgct 

gaaacaaaat 

gactgctact 

catagccttg 

aacacatttc 

ctaaaccttt 

gaaggggttt 

ttaccatccc 

atcaaaagcc 

atcatcatta 

attcagatcc 

acgtgcaaag 

aataaattat 

aaagcgatct 

ttagtaatga 

gtattaaatc 

aaccagttgt 

taaagctctc 

aggctgctct 

gtttcacctt 

taactaaata 

aagggaaaat 

tggcactcca 

agtaaggtcc 

attgctcacc 

taggagctga 

cccaacaggt 

tgtgaagttt 

ttcctgtact 

taatagaaca 



aaagacatca 

tattaagaat 

aaagcatgca 

taaaagcaac 

tgtttcttga 

tttcacttct 

attatggctt 

agacggtatt 

taaccacgca 

gaagccagtc 

ggaaagtgaa 

agagaggggg 

tggaagttct 

cagacaaccc 

ttctcactgt 

agccgtatgt 

cagtgatcaa 

gtgtgtgtga 

cttaaatgtt 

ggagagatat 

atatagttct 

gaaaataatg 

tattgtcatg 

tctctttatt 

ctaccaatat 

taaaataaca 

ttttacctaa 

gtgtgtacat 

gacagagaca 

ccccataaac 

tcgaaggtcc 

ataatttttc 

ataggtgcac 

tggggacatg 

gtgttgggaa 

cttttctgta 

gagccttttc 

tgatgtcaga 

atacagatga 

tccaacagtc 

tcatcaaata 

taaagagggc 

gtacacaagc 

ttttaaaaga 

tataattaaa 

ctatcaaact 

tgaaacaaat 

ctctgttaca 

ttctctccca 

caaatgttca 

ccagtcaagg 

aagcaattta 

ttgccagtcg 

gtctggactt 

cacgtcatcc 

ttcacgagaa 

cccagcacct 

gggaaacaat 

tgatttggcc 

atagtagtag 

tacaatggta 



atgttttcat 

ttggttatat 

aaaatgataa 

tttccatctg 

tgtgaatgat 

tttcttggta 

aacaaataaa 

tgttgcctac 

tttacggaag 

ttggtaatcc 

gagagctgtg 

gaacagttct 

catgcccact 

ttgagccacc 

taaactcttt 

aaatattctg 

aaatacagca 

tattattaca 

gagtatttga 

tttaattata 

gttgataatg 

ttgtaagtaa 

ttgtttcatg 

ccaaaggttt 

attaggacca 

tttccaaaaa 

tgtggaagat 

ctgtaattaa 

tagaaatgta 

tctttgatag 

agataaatca 

ctaagttaag 

tctcctctgc 

tactgattca 

attcttccct 

atcagtgagg 

ttcagaataa 

ccatcctttt 

gcaaaaccat 

acacattcct 

gcattttcct 

cctagaggcc 

atcactaggc 

cattccagaa 

gaatatatcc 

caagtctaat 

ccatactaag 

aagtgtgtat 

ttctggctcc 

aaactcccca 

aattcccaat 

attgaaggta 

aacatgctgc 

caccttacct 

attcactcct 

gttccctgta 

gcagatcctg 

ttgatacaca 

tcctaaagta 

gtactatcta 

ttggcataca 



tattaaagag 

taagaataaa 

ctacattttg 

gctggtgcag 

tgtgattagt 

tatgtgtgtg 

acagaaaata 

tcaacagtgt 

atgacgttac 

cagcagacta 

ctgggaggct 

ttttcttctg 

gagcatgggg 

ttaccctagg 

tttattattt 

aaatctctgt 

tctcaagcaa 

taataattgg 

ctttttttag 

gaaactcatt 

atattttgta 

tcagttttat 

cctcataata 

aagcagataa 

tttctggata 

cctcaagcaa 

tgggagagca 

taaaattagt 

acagcacatt 

ctcttaacaa 

ccatgctctt 

actagactaa 

tcattatagg 

ttttctgact 

caaagcctca 

gaagagcagc 

atgatcctac 

tctgccttta 

cctattttta 

ctgatgagca 

aacacccaca 

cagaaaaggt 

tctttgcaag 

actccagggg 

attcattttc 

gaccaaagat 

gttttaaaat 

gtattgaggg 

tcctttatgc 

cccatggctg 

ttctgagtat 

agccttccct 

ttgatgcggt 

ttaattgctc 

tcctcttttt 

accgtgaagg 

aagggcccct 

ggctcaaaag 

gaggagccaa 

gcaagacttt 

gctcttttta 



ttactttcca 

ctctaagatg 

ttatgatggt 

taagcaagtg 

ctgatttctt 

gatatgcata 

tgtttacttt 

tctttttact 

cagcagccct 

gcaagggaac 

tctcagaaaa 

ccggacaagg 

agagtcctgt 

aactcctggt 

gtagctgggt 

caattcaaat 

tattgtgtgt 

aagacttttg 

catttaataa 

agtttgtcca 

atagatattt 

atagggttta 

gctgattaca 

agaatacaga 

aggtccttga 

aaccacttta 

gtgactttaa 

acaattcaca 

aaaatcctct 

atgcaaatta 

caggaattca 

tgttgagaga 

gttctcttcg 

ccttttgtcc 

atcgctcatg 

tggtcaccat 

ttcttagaat 

gacaagacta 

aaagagatca 

acccttactt 

tgtgcaaaat 

tggaaaaaga 

agagtctagt 

attttcattt 

ttttgtttaa 

tgagaacata 

ttgcaacacg 

gaattgtctg 

tcctggagaa 

gaatggctct 

tactgatttt 

taatttataa 

gggggggatt 

tgccagggga 

gggtttctca 

ttatcacaag 

aatctaaata 

tttgttttga 

tttgatattt 

actgcaaaca 

ttttttatct 



78780 

78840 

78900 

78960 

79020 

79080 

79140 

79200 

79260 

79320 

79380 

79440 

79500 

79560 

79620 

79680 

79740 

79800 

79860 

79920 

79980 

80040 

80100 

80160 

80220 

80280 

80340 

80400 

80460 

80520 

80580 

80640 

80700 

80760 

80820 

80880 

80940 

81000 

81060 

81120 

81180 

81240 

81300 

81360 

81420 

81480 

81540 

81600 

81660 

81720 

81780 

81840 

81900 

81960 

82020 

82080 

82140 

82200 

82260 

82320 

82380 
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tttctaatag 
ccaatgaata 
gtattattga 
gctatcacat 
ttaaagaagt 
ggagtgcata 
aatgttaggt 
tcctattgta 
gatctcgtat 
cttttaagat 
ctaaaagctg 
tacatggaaa 
tgtcccagca 
gccaggcctg 
aagcacaggc 
ctgtttcaga 
attgtaggcc 
tttaggatgt 
ttgaaataag 
ttctttgtat 
atgacaatac 
attattacat 
cttaatgtaa 
atcaatattt 
gaagtatatt 
gttcatgatt 
agtattcaaa 
atgaatatat 
atattgtcat 
tctttgaaaa 
caatttgtct 
aatcctatta 
tcattgtaaa 
cactaaatgt 
aacagtcaat 
ccgatgctag 
gaaactggaa 
agtaacacag 
ttatacccac 
aggtaaggtt 
aaccctcctt 
attttattaa 
aaagagttat 
tccaatacag 
gacctttcgt 
attcttctat 
agaagaacaa 
gaaggagaag 
gaaacagaaa 
gcactttact 
acatttgtgt 
tcatttccca 
tgagcatgaa 
tataattatg 
atatttctag 
gtttgttcac 
aacaatattt 
attagggttt 
aagctgtcct 
agtactccag 
agaatcaatg 



atatgtttat 
acatgcttaa 
atatctcata 
ttaaacctaa 
tgagcctcct 
tgtgcacaat 
ttctagaatt 
atatgcagaa 
ttccacagag 
tttccgatgt 
cacttaatag 
tattacttcc 
gcatccagat 
agctcaagtc 
gccagctgcc 
ttgcaaagga 
tccataaaca 
aaaggttaaa 
acatgcggaa 
gggtaagtag 
gccataatca 
tgcagagata 
tacacttcta 
ttgattcata 
gaataattca 
aggcaatttg 
agaaactggt 
aggaagttct 
tttgattgta 
cagtttcaca 
aagtatttga 
atgagcataa 
ttcaggtagc 
caatcaaatt 
tctgtttcat 
aaccatcctt 
ccctaggttt 
aaattctgtg 
accaacaaaa 
ttacgattca 
taaagggatt 
tcaaatccat 
t tcacaagca 
gtttaatcct 
gaaacatgtt 
acaatgacac 
aaaggtcact 
aggccattta 
aacatgcctt 
gttccgttat 
tataattatt 
aagtcttgca 
atctgttttt 
aattgaagac 
aaatgatttt 
ttaaatgcaa 
tccaagatta 
cttaatcttg 
gtgcattgtg 
gttgtgacat 
gtctgtatta 



tctatttcct 
atattttagg 
tttgaaagga 
ttacatttca 
ttacgataaa 
tacataaata 
tatccaaaca 
gtttcaattt 
ctactctgtc 
ttatctgttc 
cttacttacc 
ctctgtgcca 
tttatgaata 
tcaaggctga 
ccacctccat 
gggtctctga 
aaaataaaat 
tatatatgta 
gcaaaattca 
tatggctaag 
ttgtccttat 
tgagacttaa 
tcaggaatag 
tgaatcatta 
gaattaatta 
gaaaagacta 
aagttcatga 
tgtctttgag 
tttcaatttc 
aattaagaac 
ggtaattttc 
atggctcccc 
atccatagtt 
catcagcatg 
tgttatgtaa 
gaaacagatt 
ttcttaggca 
gtcagctagt 
ttgtatctaa 
gtgccttagt 
taccatcacc 
aaggcagtca 
gatgaagact 
gaattttccc 
gctcagaagt 
atacctggcc 
cttaccattc 
tcattatagc 
taacatagca 
ttcaattaac 
tcctttgaaa 
gctgataagt 
tttttccatt 
taacctttat 
taagtgattg 
accctctgag 
tttgaactta 
gcatcattgg 
ggatttgggg 
cctcaaatat 
atttaactgc 



aaacgtattc 
gaaatggtaa 
aagaaagtga 
agacaagaat 
ctgaactcaa 
atttggttta 
tcatgatgat 
taagatataa 
attaagtgta 
tctgtatcag 
gaattagtaa 
cagactgggc 
ccatttccga 
tgttgttaat 
ggcacagccg 
cttctgtttg 
gaaatcgaat 
tctatgaaga 
tttttctttg 
tagtatgttt 
ctattaacta 
gacttactca 
tacagaattt 
tttggatcag 
gtttccattt 
tataaaataa 
gtataattgt 
aatacatggt 
gttatgtgta 
tttaaacaag 
atcctcatct 
aagtgtaaat 
gatgacagaa 
atacttttac 
atttatgctg 
agatgattgg 
cttcttaaat 
ataatctttt 
aacattggta 
ccccttaagt 
tcttgtaaaa 
atacattact 
acatatgtaa 
attttgaaag 
gacatcaaac 
ttcaaaatag 
ttgaggattt 
ctttttgatg 
gctgatatta 
actcataatg 
gctgaggaca 
gatggttcta 
ccattgaagt 
ttaaattagt 
atgagtaaaa 
ttaaaatttt 
gatacaatta 
cttttaagtg 
catcctgccc 
ctttagaaat 
aaagacaagt 



aagacttaaa 
aatggtaaac 
cacttgctgc 
atttatcatc 
atgtaagcat 
agaatcaaca 
tgactattta 
tcgttatgtt 
cctaaattct 
tctcgatcac 
tttgggaatt 
tttgattact 
tgctttgctt 
atcaggacag 
tcatggagcc 
cttttgtgct 
tacaacttct 
aatgcatgtt 
tatgggtaag 
tctttgtatg 
aacctcaatg 
aaaaatttta 
tatctgagat 
attttgatcc 
aatcagttca 
tattctattt 
ccaagtgagc 
ttaagtgaat 
aacggaagca 
cttttatctt 
ttttatttgt 
atttgaaggg 
tgacgggaag 
atgtctgcac 
atactcaatg 
attcttttta 
atttatggtt 
tcaagcagtg 
tctaacctag 
catttcggcc 
actggatttg 
tcttttggaa 
ctctacttcc 
gcttgccaag 
gtgatcctaa 
agtaaactaa 
cgaaaatcaa 
tcaaagaaga 
gtagctttag 
ttgttgttca 
ggagggtcaa 
ggagtctatc 
gtgtatttac 
gcaatcagac 
tgaagggtaa 
ctcaataatt 
ctgaatttag 
gaataattat 
tctacttacc 
tgccaaatca 
gctgtttcta 



attttcacta 
agctctatta 
tttctcatac 
tttttttctg 
ctctaattct 
aaactaaata 
ttgccgattt 
ctccttattt 
tctccatagc 
tgtcttcact 
accgtatagt 
tcgatttcaa 
aaagttaggg 
gcgggttcaa 
agccacaggt 
ctaggaagac 
ttttaaccta 
aaaaagtggt 
tagtatgttt 
ggtaagtagt 
agaaagataa 
aaggctattc 
atttgtatta 
aaataaatgt 
atccttctga 
gaactatgga 
tctgtaatct 
tctattatct 
atggatgtac 
ctatttaggt 
tatcttgtcc 
aaagattggg 
aaactttcct 
agctggaaaa 
taatcaacaa 
atataattct 
atccagcata 
tatgctattc 
gaaaaatctc 
acagccttaa 
agcaagatgc 
tgaaaaagga 
ctcggtaggc 
tcagttctca 
tttttgacag 
aactcactga 
tgatcctaca 
aaaagaaaaa 
aatttacaaa 
gccagtctta 
ataacttgta 
tccgaaagat 
taactgggta 
tatatccaga 
aatatttttt 
atgtagttca 
gcatatatac 
ttgttttagg 
agatgctaac 
tacttaggtg 
aatattacac 



82440 
82500 
82560 
82620 
82680 
82740 
82800 
82860 
82920 
82980 
83040 
83100 
83160 
83220 
83280 
83340 
83400 
83460 
83520 
83580 
83640 
83700 
83760 
83820 
83880 
83940 
84000 
84060 
84120 
84180 
84240 
84300 
84360 
84420 
84480 
84540 
84600 
84660 
84720 
84780 
84840 
84900 
84960 
85020 
85080 
85140 
85200 
85260 
85320 
85380 
85440 
85500 
85560 
85620 
85680 
85740 
85800 
85860 
85920 
85980 
86040 
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tggacaattc taaaggttta ccataaatta gcagaggcat taatctttta atgatagtat 86100 
tgtagtggat atactggtac aggacatagc tatctaagct aattctatat atacagtaga 86160 
ccaaaactga cacattccag gcatcccttc taatcttcct ttgactcctc ttatctagag 86220 
tgaaaatttt cttagcttat tttgcatctt cccagcatat cttcactggg agaaaattcg 86280 
tctctcacat ttttgcacat cttgcaagtc aggcactgac cacctttttg ttctggattc 86340 
tcttttgagg atgtttatat agcaaacatc cttggaaatg aagaatcatt tctgcctgtg 86400 
aaacagtgtg caaacatact tactccccat tatagaacat tcaggttccc taagttcagg 86460 
ttcctcttcc cctataaatt cctttgcccc ttgtaaatta ttgttttaca tgatcctctt 86520 
tgtgttgcca tgtgggaaca aaatgttgat actctggata ctgctgttac taaacttttc 86580 
ttgatctctg tcccaggagt cttctcattg cttcctacta gaattcatgt atccatatta 86640 
ggatacattt tttgttttgc cagaaaggca aaatccccag tccttctcat tactatgttt 86700 
ttgttgtttg tttgtttgtt tgtttttatg cagcctcggc gctctgtcac tcaggttgga 86760 
gtgcaatctc agctcactgc aacttctgcc tcctggattc aagccattcc tctgcctcag 86820 
cctcccaagt agctgggatt acaggcgtgc accaccacgc caggctaatt tttgtatttt 86880 
tagtagagat ggggtttcac catgttagac aggctggtct cgaactgctg accttgaaaa 86940 
tctgtccttt gaaatgaatg acaaattaca tgctatttat gccccaaaat gctaacttcc 87000 
acatatatca ttttagtata tccttataga ttatctcaca tgaaatttct ttttgcaccc 87060 
attgaaagat tattcctgag gatgtcatcc actctatcac ttattgtttc ctctttatat 87120 
aagcatattg tctttttaaa ggaaagctga tttttaccta aacaacagtt tcatttcttt 87180 
atagacatca caaaagttgc gagttactgc atttgcatcc tgcctatctc ttttaggaaa 87240 
tgaaggcttc cttttttaag ccaacctcta gcagctgcat aatatcattg atagtaatgt 87300 
tgggtgttaa ccttaccact gatttattac ttttctactt ttaaataatt gggtatctga 87360 
ggaaatggaa ataagatata tattaaatat aggtgtgtgt atatatatat atatatatat 87420 
tcaagtatgt atagtttcat gccaacctcc atattctata tataaatata tattacatat 87480 
tatattatat atatgtatgc gccaacatct taaaaagttt acatagaaag acatcttgga 87540 
ataaattgct attattagtg gagtgggggg aagaatcaca tgagcaatct aggtagcaat 87600 
atatctaccc tacatgaaga agatcacatg atctttggtg tttctcattg agtatccggg 87660 
ttagtagata acttattgac acagggatcc cagagatcct cagggaggaa agtgattgtg 87720 
accccaacac aaagcttggg aaagtgatgg aatggactag agtggcactt gacaaactat 87780 
tctgagagta gcactccagg agcatttgga gaattcacaa ggatggtaac attaataaca 87840 
agggtggctt gatactagtt ttcaacacaa gctctggccg tcctaagaaa aggctttttc 87900 
tggccaaaag tgttgcttac tctggaaatc cctaagcact ggccattctt gcatgaagtc 87960 
tttcttcggg aaaacatgcg taaatgattg aatataaatt caaagaattt taattgttga 88020 
gcccaagtat agagtgactt cactcttctc tggggactga agtggaaaga ccaagtgtcc 88080 
ttcaggatgc gtggactcca tattccacaa cattgtgtgg cagcatcaga agccactgcc 88140 
ccagtcggtg aaaaccaccc tctccacctt ctccactcta tctttctata aacaggatga 88200 
ctgattccca tacactgaat tgaccctgga attcatggac attgtggttt ttgtttgttt 88260 
ttgttcggtt ggttgtttag ttgattgaga gaagtgaggg gaaatgataa tatcagatgg 88320 
ggagaataaa ttgattaaaa tttcttaaaa ggggccgggt atggtggctc acacctgtaa 88380 
tcccagcact ttgggaggcc gaggtgggtg gatcacctga gatcagaagt tcaagaccag 88440 
cctggctaac atagtgaaac ccagtctcta ctaaaaatac aaaaaactag cagggcatgg 88500 
tggcgggtgc ctgtaatccc agctgctcgg gaggctgagg caagagaatt gcttgaacct 88560 
gggaagcaga ggttgcagtg agctgaggtc acgccattgc actccagcct gggcaacaag 88620 
agtgaaactc catcacacac acacacacac acacacacac acaaaattaa ttaaaaggta 88680 
ataaaagaag cacgatcatg cagagcgtta tgctggctac ttttacattt ttacttttac 88740 
attactcatc tatgtttcat tattttatgt gagtttttag atctacttag gcatttgtag 88800 
gtcaggtttg tttaatagtt aaattcatct acaatcaatc ttactttatg catatatgta 88860 
tttatagata cttagaatga cattaaagat tttggtagct aaatctggcc agtacaaaaa 88920 
atatatatta tctctagcct aaaatggaat gttgtcattt atgagttcat tgttacctga 88980 
ctgtggtatc tgactaaaaa aaattatcat agtatagatt gctatttttt agaatgaatt 89040 
attttaagga agcacttcca tggaccataa tctcagagtt ggacaagagt tctaaaacac 89100 
aatgtagaaa aattgtcaat attgtcagtt attaatgatt tccagtgata taacacaact 89160 
aaataaagta aaataaaaaa atgatcccat ttaactggta cttctaggaa attcaggtag 89220 
gtgtctgagg tcactgaaga aactgggtcg gttaccatat ggtgtaatct aatctctttc 89280 
aattctgtaa gtctgaaata tatggtattt tcaatgcttt ttttttttga gtattacatt 89340 
aagtcaggaa acagtctctt ggcttgacgt ttgagaggca gtgcaaaaag ccttctactt 89400 
tcagtgctat ggcattgctg atcacaatat atctatgaca ttctgataca agatctgaga 894 60 
aataaacaga aaaagggtca gtcatttcag agaggtgatt ccattataaa ggtgggtatg 89520 
tgtagcattt tcttgagaaa cttttaagtt tttctttgat ttttctctaa gttttaagag 89580 
ccctaagctg ataaatactt aatgaatttt cagtaatagt agacaacttc cattagtatt 89640 
aaataacctt tgtgaatcaa cttgtataaa tcacaacaca aatagtgaaa acaaaacaaa 89700 
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gcaacaaaaa acgctaaact aaactacaaa gaagtcttga gtgtttggat ctgacccaaa 89760 
gtgaccatta aagttgtaaa ttttctcatt aaattgataa caatgtaagc cattctctac 89820 

atcaattgaa atgccaacgt acttctgagt aataacatcc cttccatgtg gtctctgata 89880 

ttttgattct agctttttac gagcatgagt atattcttta ttttgaaggc tgtgttgata 89940 

gatggctctg aattggaggg ctaccaaaat ctaaagggat tctttgctgc ttgtcatggt 90000 

tacagtcctt tttctcttca gtagcttttg cttatctaaa atgatgaact ttctcagtag 90060 

atgtgcaatt caatcgagtc atcttttccc agatacagga atggttatta atattctgcc 90120 

ctctcagttc tctatgatac cctttatgac cattcagaca caggcattca tacctcactg 90180 

aggacagcag ggacaggata tgctgctgtg ataacaactg aaccaatgac ttggcttcat 90240 

gctctctaag agctgccaac tccagtgcgc ccaataactt ccaccccttc actacttggt 90300 

ccagtggtca gatggtgggt ggatggggca ctgcaaggca ctgctttctg ctgttctgct 90360 

ctaactcctg accagatttc acaaacagcg tgtgctagct ttaccatcca caaatgcaga 90420 

gaacttttgg gggaatactt tgcacagccc tcaaccagtt ctttcctaaa atgtatgtga 90480 

agtcaccaaa cacattgaag gttctcagta gattttcttc cttctgtctt tcagtttggt 90540 

tacctcgcct ctttgttttc attgctttca gattctttat ttatagaagg gaaaggggaa 90600 

aagctgtata ccacagccaa aatttcaatt ttgaaaaaat tccaatctac agaaaagttt 90660 

caaatttata gaaacaaaag tataataaat actagtaaat acctttcaat taaattcacc 90720 

agttacaaat tttgccacat ttgctttttt ctccctcatt atctctatat ctatccatct 90780 

atctacctgg ctacatctct gtgtctgtct ttgtgtctgt atattacaca tacacacaca 90840 

gatgtgttac cttttctatc cttttctatg tttactctag aaatcatgac aatatagatc 90900 

atgtataaga aaatcagtaa taagttaata ttatccaata catagattta cttatactgt 90960 

aatttgtggt ttcaccgtag tttgctttgc tttgttttgt ttctgatctc tatattctct 91020 

ttcatggaaa gagaatgttt atctgtgtgg tgacttggtt tattttggtt tccgtaactg 91080 

aatttctgaa gaatccagag gagttatcat tagaatgttc tgcattctga atttgtctca 91140 

ttgtttcctt ctgtttagca gcaagtaaaa catttcttct gttaagcagc aagtaaaaca 91200 

tagcaagaat actatgtagg tgatattgtg cacttaaagc atttgaacag gaggcacata 91260 

attttatgtt gctccattac tgatgatgct ggtttaatga cttggttatg atggtgactg 91320 

ccaggtgact ccattatgaa ggtgggtatg tgtgtgtata tatatatgat gtaagagtgt 91380 

gtatacatat atatatgtgt gtgtgtatat atgtgtgtat atatatgtgt gtgacacacg 91440 

tatatttata cagtaaatat atacatatgt atatgtatca cacaacatta tatttatacg 91500 

gtaaatatat atatgtatat gata.catgat aagtgaagaa ttgtggcctt gtggcaatgt 91560 

gagcatcctt cttcctaaaa attttcatcc .agtagttttt atcatctaat aattggtcct 91620 

tggtaaaaac agttattatt atgtattgaa aaaaagatga tatttctaat tttgtcatat 91680 

tcttcttatt ttttaactgg cattctttga aaattacctt tctctttata tatatatata 91740 

tttttttttt catgattcta agttatgtat ttaaggtatt ttaaatattt taagcatctt 91800 

gaaaaattta aagtaggaca tgttttataa ttactgtgat ttcttatata aacatcacat 91860 

tttgataaaa ataaagagta tattacataa actcattaaa gtcttaaatt tatacactta 91920 

tctgtttcct ttcaaagcaa tttttgttaa aggtaggaag gagtaaaatc cactgtgcct 91980 

tgggttttca ttttctattc caactaagaa gtgctgtcca cagaaatgta tcattcagcc 92040 

actcagtcaa actttcctga gaacctgcct ccatcaaggg acttcgttag ctgtaaagtg 92100 

ctgactaaac ttgtgtttat acctccactc atgaaatagg aaatatacac acacatacac 92160 

acatatgcat tcgtgtttat acctccactc atgaaatagg aaatatccac acacatacac 92220 

acataggcat tcgtgtttat acctccactc atgaaatagg aaatacaaac acacacacat 92280 

acacacatac gcattcgtat ttatacctcc actcatgaaa taggaaatat ccacacacat 92340 

acacacatat gcattcgtgt ttatacctcc actcatgaaa taggaaatac acacacacac 92400 

acacacatac acacatatgc atttgtgttt atacctccac tgatgaaata ggaaatatag 92460 

acacacacat acacacatac gcattcgtgt ttatacatcc acttatgaag taggaaatac 92520 

atacacacac acacatacac acatacgcat tcgtgtttat atctccactc atgaaatagg 92580 

aaatatacac acacacatac acacatacat ttgtgttttt acctccactc atgaaatagg 92640 

aaatacacac acacacccat ttgtgtttat acctccacta atgatatagg aaatacacac 92700 

acacacatac atatgcaacc atgtttatac ctccactcat gaaataggaa atacacacac 92760 

acacacatac atttgtgttt ttacctccac tcatgaaata gaatatgtgc atataatata 92820 

tattatgtaa ataaaattat atatatttat ataatataca attttctaat atgggctgca 92880 

ggccaatata catagatttc tcctggctgc aggtgaccat gtcacttgca ttgattataa 92940 

tatattttaa ttcatggaat ttaaataaag ttttaacatc agagactttc acagagattc 93000 

agataacttg agatagtaca taatatgttt aaaaagctaa tttttttctg tgaatattgc 93060 

ttgtttccat agacgtttca ttctacaaag gttcaaaata taattttaac taacaaacta 93120 

tgaaggtaat tgtacccatt ctgcagtctg ttagcagctt ttgattagat tctcatagca 93180 

gaattttccc atatacagtt gctaatatag aagaaaaata tcatctcctt attggattat 93240 

attgttaact tgcttttgct taatattttg gcatcttttg gtaaatttct ggaatgaata 93300 

acaataattt acgcagaata atctaataaa ggatatttat agcctatgtg aaaataaaga 93360 



60 



WO 01/40493 



PCT/IB00/01507 



tacgcatttt agggactttt gcagagatac aaacacttgt tctgtaaaat atttgttaaa 93420 
cgtagcacat agggaaagca cagatcctgg agtgaaaaac ctacagcttg aattccactc 93480 
cagcatttat tagaccagtg acttggtcaa gttgcttagt ttctctgaac atcaatttac 93540 
ttgtccataa agagagagtg gtggtacata atagatattt tgacaggctg ttaaaagcta 93600 
gagatgaagc gtgtcgggag tctgatgcat agtacttcta ttaggcaagg tttttcagag 93660 
gaaaaagaac caataggatg tgtatgtata catatgtaca cacacacaca cacacacaca 93720 
cacacatatg aatgagagac atttaattta aggaactggg tcatgtgatc atggggctgg 93780 
caaatctgaa atattcagga caggcaggca ggtctgaaga caagggaggt tttaacactg 93840 
cgcttccagt ccaaaggcag tctggaggca gaattccttc ttcctcaggg aacctcagtc 93900 
tttttctctt aaggctttca actgattaga taagtcacgc acattataaa ggacaacctg 93960 
ctttaatgag tctactgatt tcgatgtgga tctcatctga aaaatacctt cgcagaaaca 94020 
cttagtctga ggtttaacca catttctggg tgctacagat aaaattagct atcacagaac 94080 
gtctacacag aaaggtactg acatttgtta aaaccataga attctattta catgaccaac 94140 
cagaatgtgt ctagagggct ccagtgcccc tgcagatttt ctatttgtgc tggcagtttc 94200 
ttggatagcg tctcctattg gctgctctta ctttctgtct ttccattgtt tctttgaggt 94260 
ttcataggaa cagtcctgtt ttagttaaaa gtgccgctgg ctcccagggt cccaaagcca 94320 
atgagtagat aatgaatggc aagcaatacc tgatcccata cagaagaggg aaaacccttc 94380 
tctcccacac ctgtgcctta aggtagcact gtttatttaa cagacttgca aaaatctacc 94440 
atgaaaacag tgacttcaac ttctaattta gcttttctcc caattcatgc tcaaaccaat 94500 
tcctcaaatt gagaagcaag gacgagaaat gcttcatcag ttaccttcat tacagtcgta 94560 
tctctcgagg tgcactaaat tcctttgtct tcaactgatt ctctcctctc ccaacaggag 94620 
atcaatgtac agaggaatag ttcctttaaa tccaggagtg catgcaacta agcaacttta 94680 
gagatgaatg gttgcatatt ccccggagcc aattctagtt gagctcaaat gatatattct 94740 
taaaaatagt aattctattt attttacaag tgctatgtaa acagagaggt aagtaccagc 94800 
attataattc actaaaaaga cctgattttg ctgcacaaac ataactaaat ggctacaccc 94860 
tttaattgca gaatccagat ttcctccaaa ctttatctca ttccagtcag gaaaacagct 94 920 
atcgtctttt tttttttcat caaatgcttt tattatcatg tttgaagctt ttgattaagt 94980 
tggcatcgta agaaaccctt tcctattttc tttcatgttt ttcttgacaa gcctgagtcc 95040 
gtcattctgg ctggtattac aggttcatgt ttttgggtac attcatttat catggagtga 95100 
gtcactttgc tgtttttgtt tccactgtat tctgaaaagc acccacagaa acatgacaag 95160 
aatgcaggct tgcatggaag aaattctcat gccgggtctg ctctgtctcc ccagtcagtc 95220 
tatcctgcat aacgtcgtct ttaaacaata gaaagcaatg gctaaggata aaagaatggg 95280 
ttcagataaa actgaatttt ctaccaccct gaatattcac tccatattta atctgaagcc 95340 
aaatatttta acataaatgt atctttattg tctttgtcct ggtcttccaa tagggragtt 95400 
tttgtacgtg gtacctcttt gtttttttgt ttgttggctt ctactatttt cttaatttta 95460 
aattctaagc ctccaggtgt cttgcttata aaaacgattc tggattttct acttttaaaa 95520 
tatggtttct taaccatttg cagttttgct aagaggaaag gagaatctat aaagtatgtt 95580 
tacatgaaag gaaaatctct gtataacatt ctatcaaaac tgtaataaag tgaaaattaa 95640 
ttattaatat tgatatgatt tggctgttcc ctccaaatct catgttgaga tgtgatcccc 95700 
agtgttggag gtggagtctg ctgagaggtg ttcgggtcat tgggatggat ccctcatggc 95760 
ttggtgctgt ccttccccac tctctcttgc tcctgctctg gctgtatgac gtgcctgctc 95820 
ccctttcgcc ttctgccatg agcaaaagct ccctgaagct tcccgagaag ctgagcagat 95880 
gctggtgcca tgcttgtaca gtctgcagaa ctgtgagcca attaaacctc tttcctttat 95940 
aaactactca gtcttaggta ctgcttggta gcaatgcaac aacagcctaa tacaagtata 96000 
cataatggaa agtgagccat gccatttgtt cctttgaata tatttggaca attaaaccaa 96060 
caataaaaaa aagaaattat atgcagtttc tgactcgggt tccttgaact taagtggtag 96120 
cagaagattc ccctatctgg aagagagaaa gtcttcctag gaaaatgcag gagctgtttc 96180 
aagtgtgtaa tggagcagca ttcttctgaa acacattttt gcttggaagc ctaacatata 96240 
aataaaaata tgaagaggga gcttttctat ttaaggtgtg agaagagggt gtcgggtgcc 96300 
cggttctacc tcgttaagct cctttcaaat gctgagatgc catcgacccc tgattctgtc 96360 
tcccaggaac tcagactcaa aactcttgca ccagaaacaa ctagatggag aaaactacct 96420 
gacaaaagtc ctggacgatg gccttgaaat caattccttc atgtctttag agggtctctg 96480 
tagtttttcc ataggtgctt accagccaca ttcttggagg cgagctggct aaagtaaaat 96540 
tctgaactga actgagcttg atagagttct gtgctcatat tcccagcaca gtctccatgt 96600 
tgggagggaa aattgctttt tctcccattg cagtaaaaaa aaaaaaaaaa agtcaatgat 96660 
caagtctaga ttccttacaa cttctgttag tttcctgcac tccatggata atcaaattcc 96720 
tcgcaacaaa acagatgagc tgtccctcaa aattgttgac ttgtgcctta ttaggttggt 96780 
agaagtgaga atgtggtatt ttaaattctc aagactgact tcgtgtatcc atagagaata 96840 
atgtatttta tcctattttc ttaactctgt atgtatttga ataagggcga tttaaaagtt 96900 
ctctcacatc agtagaaata actgagctag agtttgcttg ttgacaatgt ataggcacaa 96960 
tatatcatta gaagagtgat aatgactttt attttggtga aataataaaa aaggaatcat 97020 
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aaaaatagag ctgtctaata atgaaccatg tgtttaaaaa agaaaagtta actttctcat 97080 
cactggaaat aaggaatgag ccagtggcca tggatcacag atataattgt aaagattcct 97140 
atattgagtg ggaacacaga caaaataacc cccaaagtct ttgctaaatc tatggtcata 97200 
taaaagaaat acagctggat tgcctatcaa aaagatccag ggcagtggat actaactttc 97260 
tttacttgca gccaaagttt tgaaatctta gtagagcgtt ttgctttaaa tttaatttat 97320 
taaccctgat tactgcagtc agcaaattaa actcaagcct gtattatgaa atattatgaa 97380 
ataaacactt tagaactgtc atctgtatta agcagctgta aaaaagcagt gtcaagcctt 97440 
gatgatttca tataaaattg aaataatttt gaatattcct catggtgatt tgtatgtttt 97500 
gttatgcaga cacagaataa aaacaagtga accaaaaaaa aatgttattt aaaaatactt 97560 
aagaaaaaaa acaatttgga aaaaaatctc gatatcggca ctaatttaac tttgtaaatt 97620 
ttccaatgta gttcatgttc agtatattag cataatattg attgtattag aggggctcag 97680 
tatttcttac ggacttaagt atgtttaagt aagagcatat ccattttgat taatgaggtc 97740 
tatatacctt ctctcttgat tactctccag ttttggggaa atcactttgg tagctgcatt 97800 
agctctttta ggtcacctga aaacttctta ataaactaag tggtacacta gtaaacaaaa 97860 
agaactcacg cggcttccct tgaaacgaat acatcactgg gaattctgag aaaccaatta 97920 
aaacagaatt attgttttga atgaatcatc agagaattct agtctaaact cttctaaaaa 97980 
cctacatttc tttctaattt tattacatta aatttaaaga tttttttcaa actttttgaa 98040 
tgagaatagt catcagatta ttttaaaaga ttaggctaaa atatgcaaaa ttcccttcag 98100 
taatacattt ttttaactgg aattataatt agtcataagg ggaaagtcat tttagtggga 98160 
aataaaattc aaaaaagcta aacattaact gctgaagttg ctttttgccc tttgcaaatt 98220 
ctttcttcat tatatcattc tctctatatt tttgaatgca tatatttcca aagaaattac 98280 
tttttgctta atataaaatt ttaggcaatt agatgcaaaa gtgtagtaaa gcaacactaa 98340 
agttgccaat ttaaacatag ggaagcaaaa cagcccagga gttaaagttc tgacagctat 98400 
cttaactagc acacatggcc agtgccactc agtacacaga cacacaccca ttttcacagt 98460 
cactgaggag ctctaagagt tgattttgaa acgtagtgga aattaccttt tttgggaaaa 98520 
taacataaat ctgagataaa ttcttgactg tccgtactca aaatattttt ttccaaaatg 98580 
aaaaaaatgc aaatgtcctc agcataataa tctaggataa gtagttaatg atgctacatt 98640 
aaggcattac ttggggcttg caaattcatg actgacattt gatgatatcc caggttagaa 98700 
gagttatttc ctttctttct ttcctttctt ccctccttcc ttccttcctt tctttctctt 98760 
tctttctttt tctttctttc tttccttctt tttttatttc taaaggaatg tgtaaattaa 98820 
taacaaaaac ctttaacata taaacagatt gataaattac tgagcaaacc aaactgtcca 98880 
cccatgtttt tccttgtggt agtctgctcc aaccatacca gcaaggccta gaaagagcaa 98940 
aaaaaaaaaa aaatgtatat atttggcggt gtgatatttt cttaaacaaa gatttttatt 99000 
gtaatgccta actatttctc tcatttgaat ctgagttgca atttagacac aaatcctgag 99060 
gtcagatcct ggcttatagc tggatactat agagcatgtg ctgtgttcct gactcctcat 99120 
ctatgatgta gaaatgataa atcccacagc agaatgttca agtgaaaaag aaataggaaa 99180 
atataacgta tcatgcataa ggcctcttat aaaaaacaca gattccaaaa tgacatggtg 99240 
attatcatca ttttattata gccaataaat tgccagggat tttaaaaact atcattctct 99300 
aagaattcta tttaagaaca tacatataga taaaatatgc attcaagatt ttgtaacagg 99360 
atctggaaat aaagtcaaca caaaactttt ctatcctgct gctggtattc aattttctat 99420 
tcttgaacag atattaattg tccacattcc ctactttctc tctcaagttt acttaaaaaa 99480 
atatctactg ggattatcta catggtgcca gacacccgtg aaggataggg tagtggagaa 99540 
aataaacatg ctcccgtttc tatggtgttt atcatataat ggaggacatt cttggcttgt 99600 
gcatttattt ggttcagcta tgtaagggat ggtaaaggga atatgttcat tacagaaaca 99660 
tattcttact ttctctccta attacagtaa atgttcagag gtatgggaat ggtcaaacag 99720 
aaaatggaaa attataataa aaaataagaa agttatccaa gtccacagta aaactataca 99780 
attttataac acagaccaag aaaatattcg catagaattt ggtatgcttt ccttgcccca 99840 
tcaaatccac cattttagtg ctctatattc tctttctgtt tttaacatat gtaacggcgg 99900 
ggctttctga gctggcttcc ttggagtgca gacattagta atacagacaa cttctattcc 99960 
tcttccccct acacattgct tgtaaatcag ttattattta tttatatgtt tttaattttt 100020 
tgaattatga cagagaagga gtagatgaat ttaggtagaa agataggaag gaattgctga 100080 
gccaaaactg aacaaagaaa ctggtcttgc catggggtta gtctccaact cagaataaat 100140 
tgaattttta tatagagtga cttgtcagat caggaggata gaaattaaag ctagaagccc 100200 
tcaaaagttg gagaatataa tagaagatta tacccaaatt aagcagaaat tcaggatctc 100260 
ctttcaggta tgtttgaatc aaaagcaaac cagggatttg caacacaaga tacacattgc 100320 
cttgggacgt tcagctaccc tcaaaccatg aattttgctt aagggtctca ggttggtaat 100380 
atcactaacc tgagtgtgac agagacaagt caatcaatct ctggaataag gcaatttcat 100440 
cataggcttc agcttattca aatcagtttt caaacatgat gggcagcata tactacaggt 100500 
tcaccaagca taggcagtga tttatctgtg aatgggaata aacagaaata acccattaag 100560 
tttacagata ttgaaatttg tagacacagt ctacaaaaat actatgtttt tagggaaaat 100620 
ggtaagtata taaactcaaa caagggacta tgtattaggt tatctaaaat atttaaaaca 100680 
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gatcaaagga aacttctata aataaaatct aaaaacctga aaccataaag cctatgggtg 100740 
aaattaacag tacattagaa ggagatgaag atagaatgag cgagttagaa gaaactcaaa 100800 
gaaattatac agaataaaac agaggaatta agagacaagg aggacagaat gaatggttag 100860 
gctttatatt gaatcagagt ccttgaaaaa aaagaaaaaa ggatgggttg aaataatgtt 100920 
tgaagaaata gtgcctgaga atattccaga gctgaggaaa gacacctgca ctcagattca 100980 
atgattggaa caatttttca gtagtataaa tacaaagaaa tgttggtgat tgtaaaactg 101040 
caaaacacca aaatcaactg aaaattttta atgtaacctt gaaaagaaag agaacataag 101100 
aaattagcat gataggagaa atatttaaat cagtaattga tttctcatca acagcaacgt 101160 
aagatgaaag agaatgaaat aacttcacaa gctgaaacat aataactgaa tctaatgctt 101220 
gaatttcttt cacaattata ttgtctaata atccaacaat cacattagaa attaaatagg 101280 
agcaagaata cttactgcag ccttatttgt attgctaaaa gcatagaaat agcatccagg 101340 
tcaataagta ggcatttcga taataattat gagatagcag ttcattgaaa tgttgggcat 101400 
ccattagaaa ggaaaaggca attcttaaaa atccacgtgg agttgtattc cagatgtaat 1014 60 
ttttgcaagt gcatagccat ttgcatacag aaaatgtatg tgtatgtgcc gacacatgta 101520 
tagattttgt ggtaatctct ggagagatac tcaagaaaat aattgtcgtt gtggagagaa 101580 
atgaggtgct gaagaaaagc agccaaggtt ggggggattc atactttttc taccccttgt 101640 
accttttaaa tcttgtttcg catgtactat tattcatgaa ttacatatat tttttaaggt 101700 
tggtttgatt ttagttttta attaaaaaca actaaaatac aaatatctcc acagttatca 101760 
acagcacaga ttaaggaata gtgttgtgaa gtgtcttcat tttaggggtg attgtgttaa 101820 
agttctgggg acttaaatga cctttttcat accatccaaa taccgattga gaatgtaaat 101880 
ccaaatctct ctacaaaatc catcctttta tttttactgt ttaatacctc cacatatttt 101940 
ttaaacagtt ctatacattt ggtactatat aagttgtatt gccatcaaat gaatattcat 102000 
gttcactcca acattcatac gtgtttttca acctatgggg tcctgggtag atgattaggt 102060 
catgagggca gagtcctcat gaatgaggtg agtgccctca tataggagac cccagaacac 102120 
tcccttgctg cgtccttcca ccatgtgaag gcccagcaag aggaaggcca tctatgaacc 102180 
aggaagtgat ccctctcctc tccagacacc aagtgaccct gcaccttgat cttgggcctc 102240 
gtaacttcca gaatcttgaa tgtatatctg tttataagcc aaccagtctc tggtattcca 102300 
ttatagcggc ccatacatac caaaagctgt ggcactaaca aaatccagca aagaagagtt 102360 
ggatctaatt gtttcattaa aatttgatta tgaatctttg ctacggccta aacagcagag 102420 
accagagttc tctccttcca ctgtgcgagc ttacaaattg gccgctaaga gacaacaaat 102480 
ctaccagcac cttgatcttg gaattcccag cctccagagc tgtaagcaat aaatttctgt 102540 
tgtttatcag ccacccaatt tatgcttttt cattagaatg gcccaaaaca agtaagaaaa 102600 
tcccccaaaa caaatcaagt gactggtaca tgcctatagg ctcagtaata tttttttgaa 102660 
tgaataagtg aataaaggta gagtgaatgt gtattgcctt tgaatgtaaa cctttaatac 102720 
caattgcagt atgatactga attcacttaa gaattatttt tacatagttg tgtagaagca 102780 
atttataata tggaaatatt tctcaaattt acagaaattt gataaatatt atatttattt 102840 
tatgtaaatt cccaaactaa aaagacatgt cttttctctc actgtccatg taagttagtg 102900 
gaagacatcg atctcaaatt acaatattta actgtattaa attgtatata tagatgcaga 102960 
ctctaggata aaagattaaa tgtttgatat ttttaagcca gatcctatta ttaaaatcat 103020 
caatgatttt tactgtgaag gtgtagccac atgacattac cttcagaaaa cttcaaagca 103080 
gcaaaagcat gtcgcagagc caaatgcagt tcccagtgcc aagaaaatag ctaatataca 103140 
tgggcatggt tccaaaaaaa cccatatggc aatgtgagtt gaacatcttt tacagcagca 103200 
attccattct aacccaagtt ctttccctta aatagcacct cttttatttc ctgtagtcaa 103260 
acggctgta cagtcctcac tcttgccaaa gtccttgtct gggatgtgca tctccaggaa 103320 
ccgaagtgct aaacctttct ttttactagc ctgctctgcc aaaatccatc ttgacacacc 103380 
ttaaattata ttagatcaaa ggctgaaaat tatatgccat caaaagcaga aagataaaat 103440 
gtttatctac cgcattaggg atgtgcaatg tttttaaagc acttgatgtt tgagtgagac 103500 
attattcgac gtctctgcag tgcagcatgg cttacttcct tggagtagtt gatagatata 103560 
ttgtaacagg ctggtggcaa atatgaagtg gcatattcag tatgtgttta ggatgtttat 103620 
gtgttctctg attcttgtgg tgctctgtct aattggaaat aaggtttacg tggtgagagc 103680 
agtgaagcaa cgacagggca atactataaa agcgactcct gccaccagca ctccatttct 103740 
tttaatggat ttctacatat ttggaaactt ttctaatttc tccttaaaaa ataaacccat 103800 
aaatgtcaag aagaatgttt ttcttcctaa tggagtttgc ttgccaagct ggctttgttg 103860 
caatgtctgt aggtgggcta cggtaaagac cagtggaatt gaagtaattt ttaaaggaat 103920 
attgtctcaa atgaaatata aacaaggaac tgtgtaccca ggcttcaaag tgataaacag 103980 
atgatgctat aatcaagtaa tttttaataa atatttgatg gaatttactg aagtccatat 104040 
ttcagtaatc catcatccac tcccagatat acattgagat tacaaaatta acaatttgtg 104100 
ctgttgccag aattaaaatt tcctagtaaa gttgaactat tatataacct tagtattttt 104160 
gcatccttaa ttttttccag attattttgc aggctataaa catgttaacc tttaggacta 104220 
ttttatgtcc aggtactact taaaaatgta ttttagatac aacaatcata caatattttc 104280 
taccgttatg gtttcacagt agataactaa aattatatat atatatatat tttttgagac 104340 
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aaattctcat gctgtcaccc aggctggagt gcagtggtat gatctcggct cactgcaacc 104400 
tccacctccc tggttcaagc gattctcctt gattctcctg cctcagcctc cagactagtt 104460 
aggattacag gcacctgcca ccatgcctgg ctaactttcg tatttttaat aaagacaggg 104520 
tttcgccatg ttggccaggc tttaacataa ttgattatgt ttcaaaaaat aagaagcaat 104580 
ttctgcatac aaatttcaga aaaattcact tggaaataag ttttattagc ttacactgtg 104640 
ggtagagaaa agtaaaacat tgaaaaggta actgtttgct ttaacattta cctgtaaaaa 104700 
cattcatgaa tttacctaca atttcttgga agtagaaata tattctcaat gttgttttag 104760 
aagcacatat ttaatggaga aatttgtttt ttatagtgtt atttttaaac ataattttta 104820 
atttagaaaa ttatgacaaa aactcgttca ttcccttttt tggaaaaacg atacttaacc 104880 
actagtaaat gttaagtcct gcattatgca caggggttaa taaaatttag gatgacatgg 104940 
ctcaggcttg tttaagacaa atagcactga taggctgttc cataaaactt gttatattgc 105000 
ttggaagaaa tgaggcccat atataattga atatatcaaa gttagtaaaa agacaaccac 105060 
cacgtgttta tccagcccct gcaatatagt gcattctgtg ctacactgtg aagatcgcta 105120 
accattatga agattcttac agtcatggga tgacaagact tctgtaatgc aatgattaca 105180 
gcactgtgtg atgaatgctc tggcctgtgt ggataaaatg ctttagaaac atgagagata 105240 
cctgcaaaag gtagctagta aaagcttcag agaagatagg acttctgaat ttgggtctta 105300 
acattctaaa aaaagaaata gtttgaccaa agacatggaa atttatttga atttgacata 105360 
ttttgtttgc ttgtttgttt gtttgacaga gatagggttt ctccatgttg ctcaggttag 105420 
tcttgaactc ctgagcaatc tgcctgtctt ggcctcccaa agtgctggga ttataggtgt 105480 
gagccaccat gcatggtctt gaatttgaca tattttgata tttgggtaag gggagaagat 105540 
tagtgcaatg gtttgcacat tggaccgctc caaacctcat gctaaaattt gatccccagt 105600 
gttggaggtg gggcctagtg ggaggtgttt gggtcatgag ggtggattct tcatgaatga 105660 
cttggtccca ccctggcctt aatgagtgaa ttctcacaat gtcagttctt ggaagtgatg 105720 
gttgttaaaa agagcctggc acctcacccc tctctcttgc attctctctg cacacatcag 105780 
cttcccttcg ccttccacca tcattgtagg tttcctgaga tcctcaccag aggcagatgc 105840 
tggtatgctt cttgtacagc ctgcagaact gtgagccaaa taaacctctt tcctttgtaa 105900 
attacccagg ctcatgtatt tctttattga aacataaatg gactaaaaca tttgatgtgg 105960 
ttgaagtgta agttttatgg aagagattgc caaggatgtg tccaaatggg gtggtgggta 106020 
aaagactttg aacgttttcc aaatgtatgc caatacatct agactttact tttgagctag 106080 
aggaagaaca ggtgcagcca aaattaattg gaaaagtgat agattgcatt ttgtttgaga 106140 
aggataattc tgctattaaa agagtagatt gcaatagaga gagtcaagaa gacaccagtt 106200 
tcagtggtca ttgtgaaaga ggacttggtt tcatggagct gggggaaaga ggtcagagac 106260 
catgggcagg attttgtggt caacttgaca tcgggatgat gtacaaagaa aaggcaaaga 106320 
ggaacaatct gggcagtgat gtcagtcgaa atttgggcag atcgttttga tagaatggat 106380 
caaatggaat ccaaataaag taaaattctc aattatgaga attcaattaa ataattttgc 106440 
tttaaaggaa actaagattt ttaaatgcca aagataaaag ctgaaagtca ttataacttt 106500 
agtcttccct aaaagtgact taagcaagaa aaaaaaacag tatatttttg aaactagtta 106560 
acagggtatt tggcttaatt taaatgatct ctccagcaga tattcctttt gtggtccata 106620 
aattttcagt tgtttattta tttttggacc atacagtaaa gggttggaca gagataaaaa 106680 
tcagggaaaa ggaaagtcca gacacagaaa ccgctgagtg ggcagaagca ctgtccgaga 106740 
tcaggggggc acaatgatgg aagtgcagtt aaggttgttt ccacgctgtc ctagatgtac 106800 
agagccaaga gggtgagtct ctcaaagcca cagttgagaa ggaggtcact tgagtagcag 106860 
gagagaaatg ctggcaagag aactgggcca caaacacagt ccaaagggag atcacggaag 106920 
cccatgagga aagtcaggaa ggagttactg caactcaata gctgctatca taaaaacaac 106980 
cctgctgcta cgtgggagct ggtgtcttcc tgtaaaatat actccaccta attatcctac 107040 
agcaagatac tgcatcagtg cattcccatt gccttgtaag tccgccttgc cttgtcttgc 107100 
cttgccaagg gtgtgatcca aaactgagtg ggctgatatg aattgggtgg tgctttcaat 107160 
tccctacata gtttgcctcc tgaaaacctc ctttggtatt tgggccacgg taatgcaaag 107220 
gagccattca tccaggtaaa ttggagctgg aggtaggggc tgagcctctt ctggggcctc 107280 
rgggggcttc atcatgggtc ctggcctgag tgcaaagata gcatctagta tcacagataa 107340 
cttctttttg atgcctagcc atatttctga ttttcttgtt tattaagagt gagaactaca 107400 
tttcctacta ggctaggtct tacttcgcag ttcttaaaat gtgttaaacc tatttgtggt 107460 
tctcctttga attcttccca agttctctcc acgcctggga agttgtggag ctggtcctct 107520 
agagatgcta tgtctatgat gccaacccaa ttacaaaaca caagcaaaaa caaacagcta 107580 
cgctagactt tccaaactct aaaacatcga atctcctctt tgaccctatc atgaaagtga 107640 
gttacttaaa ttaaagagta attgatttgc tgttaaatac attatttgcc aaataatttg 107700 
actgtataat cccagtgtag ttataattct gtgtggctaa ccctgtgttg ctctggaatt 107760 
ctgtcctctg attttatgaa ttctaagatg ctgatttaat ttaatttttt aaaatatctt 107820 
ccagtttact tgagtccaag taggctgatc cacagaatca tattcaaaga attccaatga 107880 
gctgttgagg aggattttaa gccctccact agcttcctgg catcagttgg ctttgagaac 107940 
caacctaagt atttcagagc tgctagtgaa atgaatcttt cttacagata ttacagaggg 108000 
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•ttacagaaag ttcagtttct gactcaccct gattttgaga aactgtttct tagggtcatt 108060 
ccctaaggtt ggaatcgctc tctctctctc tgttggttag tgccctgtaa acactgacaa 108120 
ttttctttct cttcagttct tcattcttgg tctagtgctt cagtttccaa gtctacatga 108180 
ctatgctaca gtgtaaggtt atttagacgt acaatttcat gatttctttc ttctgggatt 108240 
atttgcttga cttgtgatta gcttgcctga tcacagacaa tagtaacttt ttaaccattc 108300 
ttacatgatt tcccttttcc ttaaatttgt tgtttctgta ttagaagcag gaaaaaaggg 108360 
ccaacgttta gaatttgaaa agcagtgttt gaatgcactt gttacctggt aaaccagtca 108420 
tttactgcta tcttacaaat gaccacagca catggtagac ttaaagaaca attatttgca 108480 
catttctcaa ttttgtggat cagaaactgg cagcactcag ctgagagact cttctgtttt 108540 
acggggattc acagcgttca ctcagagaca gatagctggt gggcaggccg gtgtcaggga 108600 
ccaaggatgg catcactcgt gtgtctggcc cttgacaggg atgctggagg tctgggctca 108660 
cctgggaatg gagatggcaa cacctacatg tgacctttcc agcatggcag tctcagagtg 108720 
gatatggcaa cagctgcaca tgacctctcc agcatggcag tctcagagtg gatatggcaa 108780 
cagctgcaca tgacctctcc ggcatggcag tctcagantg gagatggcaa cacctacatg 108840 
tgacctctcc agcatggcag tctcagagtg gatatggcaa cacctacaca cgacctctcc 108900 
ggcatggcag tctcagagtg gatatggcaa cagctgcaca tgacctctcc ggcatggcag 108960 
tctcagagtg gatatggcaa cagctacaca tgacctcacc agcatggcag tctcaggtta 109020 
ctggtacttc ttaggtctca gctgaggact ccagtgcagg tgttgcaaga cagcagacgt 109080 
agcactgctt atgtgtatcc tgaagtccca tgtgtatcct gaagtcccag aaagctgctt 109140 
caagtagatt ccagtggaga agggagtcac caaggctgga cagtcattta tccatggcag 109200 
gactagcaaa aaaatctgca ccatctctca tcgcacacac gtgggtaagg tttggactgg 109260 
tatgatcttt ttccataatc tcattttcgt gatttcttaa ctgagaatat aatagtaaat 109320 
accaatctca cagaattgtt atgaggaatg cagaaactta atcaaattac ttaattgtta 109380 
ttttcatttc ttattaattt gttcccattg aaattaatgt tatgtctggt ctcattttaa 109440 
attatttatc tttctgaatt tggatcccat cctcaagttt tcaatgcttg actttttatg 109500 
aattcttgta ttatacctca aatttaataa aatcctttta atcttgtctt caccctagca 109560 
ctgtctatac tgtcccctgg gtggaagttt cccatcattc taattacttt tatacacaaa 109620 
tttgacttct ctctccagcc cctgtgagaa acatgtcctt tctgttacac agggagtagg 109680 
tgaaggcctt tggacttcct atttttccta tatgaaactc tcttctttca gatatcagaa 109740 
ttttcctctc tctctttcat tcagaactct gctagacaat catcttacca ggaagatttt 109800 
tattcatcac cctgtgctaa ataacattgg tactatgctg ggctgagcta ctatttttcc 109860 
ttgaatcctg ctgtaggctt ttcttcagac acttaccatc accaggcgta atgtgaatgt 109920 
gtttgttcat tgcatgtctt ctccacgtga atggaaactt catgacaaga atgtgttcca 109980 
gggctgggcg ccgcggctca cgtctggaat cccagcacat tgggaggccg aggcgggcgg 110040 
atcacaaggt caggagctca agaccagcct gaccaatgta gtgaaacccc gtctctacta 110100 
aaaatacaaa aattagcctg gcttctgtaa tcccagctac tcggaggctg aggcaggaga 110160 
attgcttgaa ccctagaggt ggaggttgca gtgagccgag atcactcaag aaaaaaaaaa 110220 
agaaaagaat gttttccttc acctctgaat ccctagtgta taggggagag cctggaacag 110280 
agtaggggtt ccataagcca atcaaataaa tgactgaaaa agagttaagg gacaaggtca 110340 
aacaaaaaga gctaatttaa ctgctaaacg attattctat ttaggccaat tggtaaccca 110400 
tgactgcttg taaaattcaa tttcattgaa tgggacatat ttcaaacata aacataggtt 110460 
tttttaatga atataactga actgcagatt gcaatcaaga tgttccctcc ctttctcatt 110520 
tagatttctc atcttattag ccatcactga ccctcctcta gtggttaaat gcttagtttg 110580 
ggacattatc tcagccatac cctcagaatg gttgattcag tggttcgttg cccatgtgtg 110640 
ctgtgatgtc aacaggctgt tgaatgctga gggcaatctc atatttggct aatgatggtt 110700 
cttcagaaag caacaccctt cagattagtt cagcagggtt caactttgtg gtgaccacag 110760 
ttgggctgag caggctgcat gtcaggcaga gctgtcagtc tgctccagcc aagagtggag 110820 
tgggcacttc ttgggatttg gataaggaaa catttgtctg agcagtttta ccaccttaat 110880 
atgggtttga ttcttctatt tgtatacaga gtaagccaaa ttaaaaaatg aaaagaaaaa 110940 
agaaccagtt catttaatca agacatgcta gtggttagtc atcttttata acaaccacat 111000 
cgataattca cctttttcat tttcaaatta gtttttggtt gttttgtaga cagagagcct 111060 
tagccataaa ttaaaagtaa tgataattag ccacatgttc ttcattatgt tgagggcttt 111120 
gaacatttct tcacatagct attagacatg tcagccacat gtaggaaaag gaactataga 111180 
atttcgtaat tttttttttt tttggaaaaa tgagcaacag aattttaagg cagaaacgtg 111240 
aaatgcgagt gatggcaagt cccagtgaag tggagaatag catgctttat gtgaatgtct 111300 
gatatttgtt ttctcagtgc ttttgatctc agaacaccta acaggactgg ggcagaagct 111360 
ttttgctgac ttgttgcatc agtgctaaag atgaaaggaa acagtcttca tattcgccgt 111420 
ggtgttgatt gccctataaa taggaaacta cccagttaac cttgcctgtt gtcatacgga 111480 
gagttttatt aaaaaataaa cagtgcgtgg cgggtagcag aatgtgggat ttctgcttag 111540 
atgcccgcaa ttctttcata tacttcatta aaatgtgcag gagaacttca gagccatgaa 111600 
taaaaatatt gttatggaaa ctgaggcctg ttatcatagc ttcagtgatt tgcaatgatt 111660 
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atttaatgaa tgagtttatg tttgaaaata tcccctgtga taatatgaat ttatcgctcc 111720 
cccctacccc tccgatttct tttattttga cacatatgga agatgctgtt ttgttttgac 111780 
actcagagga gatgctcttt ttatcaaaaa tcactgaaca gtaatgagat tttagtgaac 111840 
aatcttacca gacttcccga acataaagaa atctgatact gaggagtgca cagtgttcta 111900 
taataagtaa tcttaggtgt gtgcatgctt ttcagtggca gtattcattt ttcacgagtt 111960 
ttgtatttga tcttaccagg atttgatgta agaagacttg ggtagctctt gtgtagacac 112020 
aaagaacaga gaggacatcc ataaagccat agagacagga tcccaaccac agtgtgagga 112080 
gaaactgatg actatgccag gcatacagca ccatctcttc agcctaagag cagggggttt 112140 
accaagctca ccaaaacaac ttaccattcc tctttccgcg tgattggaac tcatcttcct 112200 
cctcagaggg tgcagagatg aatttagaag gaagtttgaa gagcccatgg tgtccagaac 112260 
tcaggcctgc ttgcaaatga gtggctctgg gactgtcttt ccacatctcc aggagcctat 112320 
tttccgggag gaaaatgaga gcactgttca gaagtactgg aaggaacctg ccaacattga 112380 
gtttcttgag agtctctgaa gttagttctt ctaattttta gtaccctgac aatttcatgc 112440 
aagttagtaa cttttgcata cttcagtttt ctcatctgta aaattggggt aatgataata 112500 
tctaccaccc tcacagacac acccaggatt aatactttgc atccttcaat ccaatcaagt 112560 
tgatactcag tattaactat cacatggtcc ctgaaaattt tattttattc tgatttggga 112620 
ataaaaaagc tttctctttg accccgaaag tttttctctt tttgttaaaa tatattaaaa 112680 
tatttatatg gatttctaaa atattgtgag ccccaggaac tctggttaat gtgcctgatg 112740 
atctctgttg ccgtcagggt tggtttctca ttagggcact tctgggcttg tagagggcca 112800 
ctttctcacc gtgtcccaga atggaccttc caagatctct ggtctccttt cctcttaaaa 112860 
ggacaccagt cctatcagat tagagtccct ctcttatgac ttcatttagc cttaattatc 112920 
gccttaaagg ccctatctcc aattgccttt ggggctagga aatcaacata tgaatcttag 112980 
agggacacaa ttaactcaat aacagtgaca ttttaaaaac tacaaaataa gggaaatatc 113040 
atagtttttt tctttacttt cacatagata ttacatcata ttaaaagata gatcatttct 113100 
tcattttaat tgaaaccaga attatattgg aataaggggt gaccaatatg ggtttttcaa 113160 
catttactct tatattaatg tattagcaga agacaagagt tcgatatttc ttaataattt 113220 
tcctagaaat tgactaatat ttttattttt actttctcag atatattcgt tttaataaac 113280 
tatttttgtt gttatctaat agtattcaat ttagacaatt tggagaacat ggatgcctat 113340 
aaacatacat aagaatacaa atgtatcctg ctttcagcat cagagataat ttagggttat 113400 
tgcaattgtt tctgttttgg caaattggaa ggattatggt tgatgaactg aggtaagtat 113460 
tctaggaaga atatttgctg ttgggtcagg catgctttca gtttttatgt atgtagagtt 113520 
tgatagacca ctgagacatt taaaagatgt tcagaaggta ttgaactttt ggaaaggcag 113580 
atcaaaatcc cacaagggaa gcctgacctg tgtacagaca tttgtgaatt gtctgcatat 113640 
aggtggtatt gaagccatgg atgaaattgc taactaaagc atactcttgt gagaaaagaa 113700 
gaggacaact tacccatcgg gcacattaac cacagttcct ggggctcaca atattttaga 113760 
aatccatata aatattttaa tatattttaa caaaaagaaa aagaaaaact ttcagggtca 113820 
aagagaaagc ttttttattc tcaaatcaaa ataaaattaa attttcagaa accatgtgat 113880 
agttaatact gagtgtcaac ttgattggat tgaaggatgc aaagtattga tcctgactgt 113940 
gtctgtgagg gtgtagccaa aggagattga cattgagtca gtggactcag aaaggcagac 114000 
ccacccttaa tctggctggg caccatctaa tcagctgtca gcatggccag gatattaagc 114060 
aggcagaaca acgtgaaaag cctagactgg cctagcctcc cagcctacat ctctctcatg 114120 
ctgaaggctt cctgccctca aacatcggac tccaagttca tcatctttgg gacttggact 114180 
ggctctcctt tctcctcaac ttgcagatgc cctattgtgg gaccttggga tcgtgtgagt 114240 
tcatacttaa atacttaata aactcctata tacatatata tatatatata tatatacata 114300 
tatatatata tatatatcta ccatattcgt tctgtcactc tagagaactc taatacatac 114360 
aataaaaatc tattaaattt ttgtctgaaa cagaaaaaat aaaatattta acctttaaaa 114420 
atgttcaatg ttgatatcaa tatgatgcca gagcccaatg aaaacaaaag tacttagttc 114480 
ccacaaaagt tataatgaga ccctgatttg acctagtttg gaaatagcaa cctttaaaag 114540 
gtttgagaaa aataaatacc tacacaagtg tttgaaaata tcccaggtgt ctaacaactc 114600 
actttctcta tgtctttccc ataccccaaa tatcaaccat gacatttttc cgctgatctt 114660 
tcagatctca gaaggcgaca tgaaggagat ccctctgctg tgagattgct attttcctat 114720 
tgtacctcaa ttacatttat aaacttcttt ctttcatgcc agcctaagtg gtttttgaat 114780 
ttattaactt taatgtttaa taaaccagtt tctatagcaa caatcctatg acagatgttt 114840 
tgttaaagtc tctttaaagc acacacctct gtcaatatgt tattttgaac atttcaaagg 114900 
ataaactctt gaaaaaggct gcacatatct caactttcaa gctcacaatt tgcattaata 114960 
attataccct ctcttaataa aagaattatt taatagaagg ctgatgagct aattttaaga 115020 
ataatacagt aaaaggtcaa aataaaactt accactctcg agaagaaagt gtaaggtcaa 115080 
gaaataaaaa aaaattacaa catggactat actactttcc tcaatgttaa ttactttaat 115140 
aaatgtgtga cgaatgatta ccaacatttt aaaatctatg tgcatttttc tccaatgagt 115200 
ggcactggtt ttctttgcac ttttagagtg ctcagtcctg gcatatgtag gtcactgacc 115260 
gggaaattac catctcaatt cactttcttt cttctttgct atttcttatc tgatctcgaa 115320 
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^^!^ Cat ^aaactatt gcaaaaccct cattattttt tccoaaatta aaatggtaat 115380 

tt tttta^t TtTtTS r t9Cat9Ct "tggctctg gtttcctttt ttt.tjtgtt l£ 40 

tttttttatt gttgttgtta tgaaggggtt ttttttattg cttttttgtt aaagaacgta 115500 

gcaaaaaaaa tgaagtatac taaaagctat ttcacccaga gaaacataat ttg?tgggtt 11556? 

tttaaaaata tcattttcat ctgcttcttt gggaaacatc agaaataggc ac? g agggat 115620 

ggcaaggttt atgctgaact ctaagtaatg catgttgtca cttgcagcca gtcac?gaac 115680 

tttctctgcc gtagcttcct catttttaat gtggactaag actccacttt tatatcacca 115740 

cgtccaagtt tagacacgtg accctaagtt gctctatcag ggatctctag gccttcccta 115800 

tlttlT.t ^ gaCt r ag tcaaa *> caa gtgaggtttt tttttgtttt 115860 
tttgtttgtt tgttcatttt ttgagatgga gtctcgctct ttcaccaagg ctggagtaca 115920 
gtggcgtgat ctcagctcac tgcaaacttc gcctctgggt tcaagtgatt cccctgcctc 115980 
ggcttcctga gtagctggga ttataggtgc atgacatcat gcactgctaa tttttgtatt 116040 
tttagtagag acggggtttc accatgttgg tcaggctggt cttgaactcc tgaccttgcg 116100 
atccacccgc cttggcctcc caaagtgctg ggattacagg tgtgagtcac cgcgcccggc 116160 
tgcaagtgag tttttaatga atgcatggac ttgaggacag aagaggagcc cgaggtt?ct 116220 
agtgggggct atgcacagtt ctgtgggtta aaggttgttc ttcctctttc cctgagctgg 116280 
ggagctaggg caagctatgt cttatgtctc ccaagagtca aaacctttgg ctcctLtaa 116340 
rrl^ttt agctcatact aa ^ctggat gaatacattt ccacaacaga aactcaaaat 116400 
ccatctgttt aaggatgcat gacaaatttc atttagaaca acagtatgag agtcatattc 116460 
aaatgtgaag caagagtgag cacaaggctc ttatttgttc cttaagaaaa aaaaatttat 116520 
£t™S)£ ^^ aa ^ C f tcaaccat 9 c ttcaggaaga aaaatgtagc ctggcctcct 116580 
cctac^ "SjSftgt ggaaaagtgg acctttaaga tttcactgaa agattacaga 116640 
tltltl "ttttacat tccaaacctt tgactggtga taaaatgaac aaaagaattt 116700 

aaa ™T tggaaaaaaa tggatccctt ttcatccgtg agggccctca ggctatatac 116760 
aaacacaata cacaaaacct gcatcattta cctgggacaa tgttatgggc tactaagtcg 116820 
atccttttag aagagcagtg agttaatgga ttgttttcct gacaatattt gtagtacagt 116880 
IttTrltt* f ggggaaaa tatttttatc tggcttttaa atgttatgtt ttatggttgg 116940 
gttaatgttg tggctgatca gctgttacat ggtgcatgaa tattgactgg attctccatt 117000 
agggctgatg gggcatcacc gaccagctat gtgatctatg ttctcccact ggctcttgct 117060 
tggcatgaga gatacatttg caggatgagg agcaggaatg agagctatcg attgtagaaa 117120 
cagaatatag tagagctgat tgacctgttt caaaattaag aaatctgttg ccatatcttg 117180 
cattacactc ttttatttta tattcttact ctgaattcct tcatgaaatg cacaaactcc 11724? 
a "^ ata tacatatata tatatgtttg catttcctag cttttaaaaa tagtttaaag 117300 
tataccttct ccaaagggga aatactttga aatatttttg gaagagaaat tgtattagtt 117360 
tagatccata ataaataatt atattgtttt aaaagctggt gcaaagtaac attttcttat 117420 
attgtagaac tctaagcctt ttggcttaga aaatgagagc aatattcttc tgcttttcat 117480 
cacttacagg gatattcttg gtattattat ataaatttta ttttcattaa aLtaccata 117540 
aaattccaaa tgtgtgtttc gaattactgg cagaaaaatg ttggtaattt tttccagtgt 117600 
gactcaactg tcattataat taagtaattg tctccaccaa aatgtttgtt taggcaatac 117660 
atttctgcat atttgaaaac ctaataaaaa gttttattaa agctgttcac tcaaaaatag 117720 
acaactgtgc cccgctgagc agtagtttaa aaccaaatca atgaatttgc tattatgtag 117780 
acaagaaagt gtcctatcat tgacattagt aatgtttcca ccaagttgaa cgtgatgagg 117840 
gagctgttag ttgttaccct ggaaactatt caaaaggtct agagctgctg aaaaacagaa 117900 
gactctgggc tcacccctgg gattccgctg cctttcagac catctaggat ccaaattttg 117960 
lltllTrll f\ tatgtct g aa «tgctt tactgccatt ttagactgat tgtatctaat 118020 
tttaagttta tgatgccacc tctctttttc cctatatttc taaactttct ggaagacgtt 118080 
ctgaaatcat tcaagtttat tctcattttt tatatctcag gactagtcac tcaatct?ta 118140 
aataaccact tttgatcaag agattactgt acccttctga tacaacttaa gatcataaaa 118200 
acaagttgat ccaattcaat agctttttct tatattataa ttcatgccct ggccgaataa 118260 
ttccaacttt ttcttttttt gaaatatatg ttaggttttc ctggttctca a?caatctac 118320 
gtgagcactc attctttctg tattctggtt cctactacat cttttatcag tgtctctttt 118380 
tgcattcatt gatcagtatc tcttttttta aatgcagaaa aaatagagtg tgcttataaa 118440 
tgtttaacta ttagccagct atttaacttc ctaacactat ctgatatgct tgttattaaa 118500 
D C ^ f tttctatgcc tttgattcgc cttatacact actaaggaca gaagagcaa* llllso 
ggagtcgatc taaaaccaga gaaccagaat cccttactcc cacagagaga ttgcctgtgg 118620 
gctttaggca tcggtttact tgtctctaga gctcaagttc acaagcacca gtagccccat 118680 
gggtagaatg tgaacctctg ttctatacag catgttgatg aatgcattac ttctgectta 118740 
gtcatgtcgg agcacagtaa atgtcaacta gcagtgcagc accggaacac gacatgcaat 118800 
ctgctcacct gagtcatgct gctttcagaa gagaaggcac tggtttaaat ctgcatggca 118860 
agcctgcccg agctagaatt cgatgagaac cctcaggtta atagctttgc tctttgagag 118920 
agctctaagc tcttttcatt ccagaccaat ttcacattta catactacct gtcactaaag 118980 
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ccccaaggtc cttcgccaac aagaagcaga 
ctttctagcc tttggacaca gtttttaggc 
gccatttgtg ttcctaaaat agcaaggggc 
tttttttttt gttgttgttg ttgtttgttt 
tgtggtccag actggagtgc aatggtgcaa 
gttcaagtga ttctcctgcc tcaccctcct 
atgcccggcc agtttttgta tttttagtag 
gtctcgaact cctgacctca ggtgatccat 
aggtgtgagc caccacgccc ggccctagaa 
attccgttta aaatgtcact ataaatctat 
tattccactt aaaagaaaaa caaatcagag 
gttagggtac acagcccact ttgggaccat 
aatgtgattt cttttattat tattattatt 
agaatatgca ggtttgttac ataggtatac 
tcctgtcatc tacattaagt atttctccta 
cgacaggccc cagtgtgtga tgttcccctc 
cccacctgtg agtgagaaca tgtggtgttt 
aatgatggtt tccaacccgg atgaagcttg 
gtgccgtcca tgagtgagtt aggatgtttt 
tcttccttaa gcataaaaag atactttatt 
tcctcaagag tggcttatct aaggcctcaa 
attatatgta ttgattaagc ttcatctatg 
tcagaaagag ggtgacactt cctgatggac 
agccccgagt tagactaact tgtgaaagag 
gacttggaaa gtgctgattt tgaagcccca 
tgactggccc cattttgctc ctaagcccac 
catgtaggcc aagctggcta cgggtggaat 
ttgtaataat tccttcccaa aactaaccac 
aagaacatta ggttaaagat tataagagca 
cagtccactc cattggaccc tcattgctgc 
gatttcaact ccttccctct ttcctcttcc 
ggatggttct ttaggacact aatcttccat 
gtcagcttcc ttgctgcaac atcttgtctc 
tatgggcttt gaactcagct acatgtctcc 
ccttctgttc acacctgtct ccatctcccg 
cagaggagat ggtgcgttcc ctcaagccat 
gatcctgggg ctaacaaaac atcaaagcct 
atacttctag ttccgttagc ttccagccaa 
aggagcacat ttttggagca cttagtatag 
ggcagacatt tatcatagca gttgtgtaag 
gaaacatata ccaaaaaatt ttaagtaata 
aagcaggagc ttagaaacaa gcttgtctga 
cagaatagag aatgttgctt gaaaacatag 
cattcttctt tgaattctcc acctagattt 
acacttaaaa gtattgttct tttaaggaca 
tcaaatttct tccttagtcc tacaatttct 
ggagtttact taagtcaatc aaccttcact 
tttcctttca tttagcttag ctcaaagtct 
ggagtctcgc tctgtcaccc aggctggagt 
tccgcctccg gggttcacgc cgttctccta 
gactacaggc gaccaccacc acgcctggct 
ttcaccttgt tagccaggat ggcctcgatc 
tccctaagca ggggtgtctt tatttattta 
tgagaaagag tcttgctctg tcgcccaggc 
gcaacctcca cctcccaggt tcaagcgatt 
ttacaggtgc ccgccaccat gtctggctaa 
atcatgctgg acatgctggc ctcacgacct 
ctcccaaagt gctgggatta caggcgtgag 
ttaatgccgc atcttcagtg cctaccatag 
ctcttccact tatccctggg ttaggcccaa 
gatcccagaa aaatctgttt tctctgcacc 



tatttactca ctaaagttaa tgtcccagcg 119040 
aacaaggaaa ataatacata tgtataaatg 119100 
tataacccaa agcctagaat tctttttcgt 119160 
gttttgtttt gttttgaaac agtcttactt 119220 
cctaggctca ctgcaaactc agcctccctg 119280 
gggaacctgc aattacaggc gccagtcacc 119340 
ggatggggtt tcaccctgtc gttcaagttg 119400 
cgtcttggcc tcccaaagtg ctaggattac 119460 
ttctttttct taaagacaat tataatggta 119520 
ttcatttatt actgcatgat tttaatctat 119580 
aaatcagcaa gggggcaata acaaatgtgg 119640 
attcaatgaa gaataagatg agagaaagat 119700 
attatacttt aatttctggg atacgtgtgc 119760 
acgtgccgtg gtggtttgct gcacccatca 119820 
atgctatcct tgccctagtc ccccaccccc 119880 
cctgtgtcct tgtgttctca ttgttcaact 119940 
ggttttctgt tcctgtgtta gtttgctgag 120000 
atttattaag accaatagta attacagagt 120060 
ctgtgacatg tgagagaaag cccaattcaa 120120 
ggcatgtatg agtaacagtt caggagtaga 120180 
agatcataat cagagtctaa tttaactcat 120240 
taagctctat acttaagctc cagttcctac 120300 
ataactgaaa ccccagaatg gggtcacaaa 120360 
cccaccttag aaccaatcat gatgcccaaa 120420 
tcatcgaagc agtgaaaagg aggcccggca 120480 
ctttaggtaa acggcttttg cttatctctg 120540 
ttggtttata gtttcacttt aaagcaaggt 120600 
ggaggaaaca agaagggcgt acacataatt 120660 
ttgtgacctg accaaggatg aagaacttca 120720 
ccagatatat gtggtcattg gttgcctcgt 120780 
ccaaacttaa atgttcctaa aattctatta 120840 
cttctcagtt tgcttgtcct ccaaaataaa 120900 
ttgacttatt tgctgttgtg tggcaagcag 120960 
aagtctgtga aaatgatgtc ctcttagagt 121020 
gggctggaga tgtagtcagc ttgacctatg 121080 
atgtgaattt cctgttgcca taagaagatt 121140 
attatagata atatttaatt aaatctatgg 121200 
ttgtcaatgc cacttcataa aaattaagac 121260 
actagatctt atgttaaatt atttagacta 121320 
taattataat tgtgtcttct tagagctgag 121380 
ttgacaaacc cacatagcca gtgagaggag 121440 
ctccagagac tgcagctttg cttataaaga 121500 
gttcaaatat atgtttcacc acttagtagc 121560 
gggatatttt cttctatagc taatgcctaa 121620 
ggcaccttca gagcctgtca gaagttgcga 121680 
attagcctct ttttctatgt cagttactct 121740 
tcaaaacttc ttccctggaa tctttttcct 121800 
atgcagtctt ttttttcttt tttttgagac 121860 
gcagtggcgc gatctcggct cactgcaagc 121920 
ttctcctgcc tcaacctccc gagaagctgg 121980 
aattttttgt atttttagta gagacggggt 122040 
ttctgacctt ctgatccgcc cgcctcagcc 122100 
tttatttgtt cattttattt tattgttttt 122160 
tggagtgcag tggcgtgatc tcagctcact 122220 
ctcccacttc agcctcccga gtagctggga 122280 
tttttgtatt tttagtagag atgacctttc 122340 
cctgacctca ggtgatctgc ccacctcggc 122400 
ccgctgcgtc cggccaaagc agggatgtct 122460 
agcccggcat gcaatagaag tgtgcgctaa 122520 
ctcctccagc tatagtagta gagtcaagga 122580 
caggctttta agtcatcatg agcaagactc 122640 
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tgccttttct tagtttttcc aggtggtgaa gatgtggaac aggaaaaacc acataaaaat 122700 
gcaatcacat gaataaatat tgcctacaat ggctggattc aactgaagaa cttacactct 122760 
aaatcaggag cacagactca tgtttatgcc tttgtcccag gcaccaactt ctcttgaact 122820 
tctttgctga ccctgcctga aagtggagtt aactaaatct tgtgtagatc tgtctcatat 122880 
cactatttag cccatttgca gataggtatg atattgttaa aatgatgaaa atgccccacg 122940 
ttggactcag ttggatattc tgctccggca gatcaggttc taactggaat cttggttcac 123000 
ttcacttttg taggctaaga tattctattc gtacttgact gagcattgta ttgatgatcc 123060 
ctgataatgc agctcttcaa atgtaataaa tttcttttgt aattgactgg ccatgcccac 123120 
tcaagagata tatgtttgtt gcctgattga gtgtctaagt aaagaattaa atgaataaat 123180 
gaaagagtaa cataagaaat aaacaacact gggacaatga actgagactt catttattaa 123240 
gaagcaatta ttgataattt tgtatatgat tttagactat aactaatata ctccaaatgc 123300 
aatagtagac aagatagaaa tggtcccttc cttcatactc gtggttgaaa tacattgtat 123360 
tgctgatgtt ccatagcttt tttacacatg ttgttttcat ttgtttcaac taacggtaag 123420 
gagtgtggat atttgaatcg gccagtgcaa aatgcagtga cttggaaaga gggaatcact 123480 
gttagtgaat cataaacaat atttgaaggt ccttcagtgg tttattaact tcaaataata 123540 
taactagatg agtctcggtt ttacaagtaa ccaggaagta atgcaagaat cctagcgagg 123600 
gtggtatcaa aaacataaag tttatgtata aaaagcattt gaactcaaga atttgacata 123660 
gaccttagtg aaaatgcagc tatttcttgt cagtttaaaa gaaataaata atttgttaaa 123720 
ttcaaagcaa gggatcccta atcaacttag ctcacaggta atttcaaata aacataatgc 123780 
ttattaacag ttccaatatt tgactttttg ctcaacctaa attcagcaat gcaatgtgct 123840 
gagatatatc ccagtaaaga gcagctgctt ttgttttgtt ttattttgtt ttaatttatt 123900 
tcacaaacaa gtttctgcat gagcctgaat ctttacaatg tgaaagctgt ttggattcag 123960 
gtaggaaata cgtttgaact gatgcatacc aaatttccaa aagttcacct ttttatctta 124020 
ctgcttagct gttgaataac acaacttttg cttagtcaag ccttctttca ttttcttgaa 124080 
gatgaaaatg taattcttga gttacttctg aagaacaggt tggggaaatg caaataagag 124140 
cattactctt ctagagtatg aaaagattac caaatttagg ttttctggtc agtaaaatta 124200 
tattttccta acttgtttac taattacaga gtcatttaat tagattatca tttcttcttt 124260 
ttccattgcc atctttttaa atgatgatat ttccagaact gctctattta actgttcagt 124320 
gaaattacat tcatgaagct gacaattatt cagaagtaag tttcctatgt aactctccca 124380 
ttcttcatat tctttgtttc taatttatat ttgggtctga aaattgataa agttactagt 124440 
ttagaaataa ttaactgtga cttcttattt tcagaaacat aaatgagggt gcacttctcc 124500 
gcagaacctt tttcacattc ttttaacaac caaatcattc ccaattttct ggtgtgtaaa 124560 
ttccatgtga tcagtacaaa taaacaaact gacaagattt gggcagttgg aacccattgt 124 620 
tttacaacgt ctttgtttaa aatctttcat aattcatctt cagcaatggc cttcctagca 124680 
attagttacc tttagtgtct tcattttcgt ttaaatgttc aatgatcagg ataattagaa 124740 
ttaatttctc attgtcattt tatggtaatg aagcttctgc cacttatcac aagcatcctg 124800 
caggcacagt gttaccacac aaatatgtga ctcctcactc ttcacacact ggtttatggt 124860 
ccctggttaa agtggtaaca gtaggacctg gcctgattcc agtttaaatg tatgccatat 124920 
tagatgaaag attgtgggta gtcatgctgg tattcttgcc ccttgtggat ggcattctat 124980 
gtgcaactca gttctcctta agcaacacaa gagaattttc gtctccatta gtctgggtgt 125040 
tttacagact gggattagtt ctacagttag aacaaaagga aatatctttg tgaatgaaga 125100 
tggaaggcaa tgtcaattca aaactttata aaaacttcag atagaattca ggaaagcaaa 125160 
ggttgaaaaa cttagtacca gaaaatctat gaagaagaaa gtgagtcagt gaattctcaa 125220 
gctggaagag ccttcttgca ttatctggtc taggattctc cagccacaaa aaaaaaaaaa 125280 
aagtaattgt gtattatcat ccccattcat tccctgtcat atctctgcct aagcaattgc 125340 
agtagaattc atcctatcat ttgtatcctc ttccttactc ctgggaaaca tttagatagg 125400 
gtaaaggatt tggttgagca gaaacctggg gtgggaaata gggagcatat ctaggtttgt 125460 
caaagaaggt atttaaggct gccctgtgaa atcataccag tttcaaaatt agaaacttga 125520 
tcttttttac agctgacttt tcttcaccac attttaaact tgttctttat tctgcatttt 125580 
ctcaagaagg gatatgatat taccaagaga aatacttgct ctcaataacc ccaacctcat 125640 
tttcttatga gtttgggtaa gttagaaaga aggtcaaact tagtagctta aagaaaaaaa 125700 
cacattttct tttgcttata attgttgagg aaggaaattg gaaagggcta ggcagggcag 125760 
aattggaagt tttcctgtgt gtctggcaga tgttggctag tgctgcaatt atgtggagtc 125820 
tatttaaact ggccatccta tggcttcttc atgtggctgg tggtgaatgg tggctgttgg 125880 
ctgggaactc agccggggat gacaagtgga tcacctccat gtgtctttgt agcacagcag 125940 
tttcaggctt ttagaatttc ttccagtggc tggcatatac ctgaaagaat gtcccaagag 126000 
agtctaaggc agaaatgtgt ggcttattta taactagttt tggaagttac aatgagacat 126060 
ttctgctgca gtttactgat cacaagagag ttactaagat tgatcgagtc tctacccatg 126120 
tgagggaggt gaatcaaaga acttgttgga caggttttag acctgtcata gagtgtaaaa 126180 
aagctaagag gcttcatatt caagggaaga aatactttaa agttctccag ggatgaggct 126240 
agtagtgatg attcagggca agaccagtga agtaagagag gctgggacat gcattgaaga 126300 
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agctcactga gaaagttctc tggtaatggg ctgcaatgtg ctggagttag tccacaatgg 126360 
cttgcaagat ccagttactc acttgttttc tcaattccag ctctgagttc agtgatgtca 126420 
ttgatagttt gaaaatgact acagtggaat atttacacca cattaattgg cagactctac 126480 
aaataagggt gttttttccc caggagaccc agttgctgaa tagttaccaa ttggtaatgg 126540 
ggcaacttag aaactacttt taggagaact ggactgtctc atagcaattc tatgttgtga 126600 
actttggggt taaaaccact ctgcttttgt aaccttttca gtaaaatttg cgaggttccc 126660 
tattgctttt gggtgaatga cattttgggg gataatctaa tgtgattctg aatttgagtc 126720 
aacactggct gtttcagaac agcacatgtt gaagcaggag cctgatgcag aatacgtcat 126780 
tagagggaaa gtgctggtaa ttagcagaaa ttacaggaag ctctgtactt caaagacttt 126840 
aaaattgagt gtgtgaacca agcacatttg agtttccaga gaaggaatga agctccagtt 126900 
gacactttgg ttatacaact gcatgataat ttatcattta cagtatactt ttatgtacct 126960 
cataatgttt aaatgggtta tagatcaaat gcaaatattt aaatattttg catgacattg 127020 
gacaagtcac ttaaattttc tgatactcag attcttcatc tctaggaatt agctgaaagt 127080 
atctttatag agagttgttt cagtgagtgc tgtaagatta aaaagaaaaa taagagacca 127140 
tcattagcca tgagtttagt aaatatttcc cagtggtgga acaccaattc caagtttatg 127200 
taagttggtg ttgatgggaa ataagcaatg attaaacaaa ggaaaatcag gagaaagatt 127260 
gttaacaaag ggcaatgctg tataagtgca aaaaaaaagg ggaaaaccac agttttttga 127320 
tttcacaagg aagcagaggg ctggctttca ttcatgcccc actccagcct cagttccatt 127380 
gcttcttgcc ggcaaagctt tctcgctctg acagaaggca agcaaacccc aagaggcccc 127440 
aaggtggcct gtcagagggg gtaaaatata tagttttgca attgtataat attaatgttc 127500 
ttatcagagt gagaatgtga gtagaaagca gagattttaa taatatgacc caaggaagtc 127560 
aattttgaac attttacagt attaacaaaa atgtttagaa ctttagaata tatttaactc 127620 
aataaaaaat aaatatattt tatgttatgt atgaataaca taaacaaatg tttggattag 127680 
ttcagagttg atattttaaa ataataaatg ttttgttcta gtccatttaa ttatgggtct 127740 
tgattggcaa ttctggaaag gactcagaaa gtaaaaaagg agagtaggga aagcctcaag 127800 
cttagatacc acctaagtca tcattaatag ctattggtag aaatatgggt ggtgaagacc 127860 
attctgatgt gttcccagat cctactgggg agaatgggat tggaaaatgg agaaaagtct 127920 
atctttgtta tgaagtgtca cataacttgg ctgagttgtg ttcctgtcct gatattttgg 127980 
ggaaggtaga acttgtgagt gatgaaattg tatatttggc tgcagcactt tctaaacaac 128040 
actttgaaag agtagcctcc ttcctctttg ctgctcctag taaaatggga gaagagaaaa 128100 
attattagag gatggaattg ttaatcaaaa ggaaagcaaa acttaaatat ctggaaaatt 128160 
cttagtctat ccatattgta aaaaatgagg aagtatcttt gcaagagaag accaagggtg 128220 
tgaccaaagg acaatctaac agaattactc agtcatttaa cagaatctag aacctgttgt 128280 
ccnagaccat ggaaggataa tcctaaaagc aattcaaaaa ctatcagaac tgccacccct 128340 
atcgcagacc caaaatgcaa aagtcggggc agcagggtgg tttccaattt aaagaaggga 128400 
ccaccacgac cccgtgctgc ttcacattgt ggatctacct cccttactct gttaccatgc 128460 
accttggtag ccccagattc agttctggtg aactctggca gggaattgtt aggggggcag 128520 
agccaccaca gtgaacttct aataggccaa tgcccagcag agccacaaga gtagggctgc 128580 
ccctgagacc tcagacctgg agagccacgg gcataaaatt cttggctgga agatccacag 128640 
gcacacatct ccaatccatg agagctgcat tgtaagttgc atctggcaaa gtcatgtagc 128700 
caggggttcc cagagccttg gaggtccaaa ctctgggggg caaagctaag ggagtaccac 128760 
ccagggtgct cagaagggac ttttccccaa agtgggttgg aagggcagag catcaaaccg 128820 
aagaagattg ttattgagac ttatgctata atgttttctc tgttgggctg tggacttagg 128880 
gcctctttcc cctttcctct ttcctgtttc tatgttttag aatggggatg catgttctgt 128940 
gcctgtccca tcattgtatt ttggaagcac ataacttgtt tgatttcata ggttcacagc 129000 
tagggaacaa tttgcctcaa aatgaatctt accttgagtc tcattcatac ctaacttaga 129060 
tgatatttag atgagatttt ggactttgaa ctattgagtt gatgttggac caagttaaga 129120 
cttttggggc tactgggatg gaacgaatgt ccttgcttgt gagaaagaca tgaattttga 129180 
gggctaggga cagaatgctg tggtctgaat gttcatgtcc cctcaaaata tgtataaaaa 129240 
tttgattttc caaagtgata gtattaagtg gtggggcttt tagaaggtga ttatggtgga 129300 
gccctcatga atgggattac tgcacttaca aaagaagccc aaggaaaacc cctttctcct 129360 
tccaacatgg gaagatgtag gtgaaatatg acatgctttt ctgtgaacct aggtaaggat 129420 
ccacaccaga ctccaaattt cctggtgcct tgatcgcaaa ccttttagcc tccagaacta 129480 
taagaaataa atttctgttt ttaatgaagc tgtgcaattt atgatacttg gttatagcag 129540 
ccagaagagt ccaagacgtg gtcctacaat gaagttagag aattttttta agtgtctgtt 129600 
tattgttaca aatggcaaac aaaaggtcat tatgcttaca cacacaataa atgtgtacaa 129660 
tgtgtatgag taagtttaaa acaacaattt tttttaatat gaaagtccca gttagtttta 129720 
aaagaaaatg aacaactgag agtaataaaa cctttgttct tgttttgaca gtgatgggga 129780 
agggttttca aaaataatga gttggcttca gtttggaatt agggaattac ctaccataaa 129840 
actgtctttt agatatccta aaatatcatc tgttttgaga tacacaacat ctaaaggcca 129900 
atgactttat tttcctgcac agagaaagga gacctaagaa ggattatttt aggtttggat 129960 
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tcagaaatgg ccattgtgta aagtatgtga gagcagattt gaatatttgc catagtgcag 130020 
aaataattgc ttatagtacc tgattttttt aagttattat ttcatccata atatatcttt 130080 
ttcttatatc tttaaaattt ttacaagata aaatgaaaat tgaactaact tagacaaagg 130140 
ttattttcaa attataaaag tatatagatt tcaaaatgtt tattgaatac ttatgcacaa 130200 
tggtcaaaac acttatattt ttatataata cactatgatt tttagcattc tgagcaaata 130260 
tgtgaaacct tttagatatt tgcattattg ttaacaaaga agttagtgaa acatgtcaca 130320 
tatatacagt ttttttatat tagcttgaat ttcttaaaat cagcctttac caggtatatt 130380 
agtccattct cactctgcta ataaagactt acctgagact gggtaattta taaagaaaag 130440 
aggtttaatt gactcacagt tcagcgtggc ttgggaggcc tcaggaaact tacatcatgg 130500 
tggaagggga aacaaacacg tcgttcttca catggcaaca ggaaggagaa gtgctgagca 130560 
aagaggggaa agccccttgt aataccatca gatcttatga gaactcattt gctatcatga 130620 
aaacagcagc atgagggtaa ccagccccat gattcaatta cttcccaccg ggtccctccc 130680 
acaacacatg gggataatag gaattacact caagatgaga tgtgggtgtg gacacgccaa 130740 
actatatcat ttcatccctg ccctctttca agtctcatgt cctcacattt caaaacgcaa 130800 
tcatgccctc ccatcagtcc ctcaaagtct taactcattc cagcattaac tacaaagtcc 130860 
aagtccaaag tctcatctga gacaaggcaa taccctttca gctatgggcc tgtaaaatca 130920 
aaagcgagtt agttacttac tagatacaat gagggtacag gcaatgggta aatacatcca 130980 
tatcaattgg gagaaattgg caaaaacaga gaggccacag gccccatgcc agtccaaaat 131040 
ccaatagggc agtcattaaa ccttaaagtt ccaaaatgat ctttgactcc acgtttcacg 131100 
tccaggtcat attggtgcaa gaggtgggct cccacagcct tgggcagctc cacccctgtg 131160 
gctttgcagg gtacagtccc cctccccgct gctttcatag gctggcattg agttcctgca 131220 
gcctttccag atgcacggtg caagctgttt ggtggatcta ccattctggg atctgaagga 131280 
gggtggccct cttctcaaag cttcaatagg cagtgcccca gtggggactc tgtgtggggg 131340 
ctctgacccc acatttcctt tccacactgc cctagcagag ttcttcatga gggctccatg 131400 
cctgtagcaa acttctccct ggatatccca gcatttccat acatcctctg aaacctaggc 131460 
agaggttccg aaacctccaa tcttgtcttc tgtgcatcca ctggcccaac accacgtgga 131520 
agctgccaag gcttggggct tgcactgtct gaagcaacag actgagctgt accttggccc 131580 
cgtttagtca cagctggcat tgagccagtt gggactcagg gcaccatgtc ttgaagctgc 131640 
acagagcaag ggggccctgg ttttggttca ataaggcatt ttttcatttt aggccttcag 131700 
gcttgtgatg ggagcggctg ccatgaaggt ctctgacatg tcctggagac attttcccca 131760 
ttttcttggt gattcacatt tggcttcttg ttacttatgc aaattcctgc agtgtgcttg 131820 
aatttctctc cagaaaatgg gtttttcttt tttattgcgt cttcaggctg caatttttta 131880 
aacttttatg ctctgattac tcttgaatgc cttgttgctt agaaatttat tctgccagat 131940 
accctaaatc atctctctca ggttcaaagt tccacagatc tctaggtcag gggcaaaaag 132000 
ctgccagtct ccttgctgaa gcatagcaag agtcaccttt gctccagttc ccaacaagtt 132060 
cctcatcttc atctgagatg accacttcac tgtccatatc actatcagca ttttggtcaa 132120 
agccattcaa caagtttcta ggaagttcca aattttccca catcttccta tcttctgagc 132180 
cctccaagtc tctagaaagt tccaaacttt tccaagtttt cttgtcttct tctgagccct 132240 
ccgaattgtt ccaacctgtt cttgttaccc agttccaaag tggtttccat atttttgggt 132300 
atccttatag ccccactccc ttggtactaa tttactgcat cagtttgttc tcacactgct 132360 
aataaagacg caccccaaac tgggtaattt ataaaggaaa gaggtttaat tgaatcacag 132420 
tcctgaggct gggaatgtct caggaagctt acattcatgg cagatgggga agcaaacacg 132480 
tccttgttca catggcagca agaaggagaa atactgaata aagagagaaa agccccttac 132540 
aaaaccatca gatctcacga ggactcactc actatcataa gaacagcctg aggaaccacc 132600 
cccatgattt aattacctgc taccgggttt ctcccttgac acgtggggat tattggaact 132660 
acaattcaag atgagatttg ggtggggaca tggccaaacc atatcaccaa gatataaaat 132720 
taagttgcct cactttaata acaaggttga gttacatttt aaaattatta ttaaagtata 132780 
tatgtgtgtg tgtgtgtgtg tgtgtgtata tatatataca tatatgtctt taaagttatg 132840 
ttaagtgttc catgtactac tgtattgagc acatagaggt gctcagtaaa tgtcaatgtt 132900 
aggtgaataa atgagcggat aaaatatgga ccaacttagc tggtcagtga gtttatggtt 132960 
acttatcttt gagaagcatc ttttgccttt ctccatattg agattgttta attgagctgt 133020 
gtaacttttg tatatgttat tatatgtgtt taaattttgt atatattttg tctaccttag 133080 
tagcaaattt tattgagaat atttttttct caaaatagca tccctttaaa ctagaaaaaa 133140 
aaacagtttt tatagctctc ttttgaacta aatactaaag aataaattgg tcaagttctt 133200 
tctctgatca tgaagtacct gcaaactgct tcccataaat ggcagaaaag gttaagagac 133260 
attaccagga tggctgcccc aaattgttgc tggtatcatt aaaatagtca acaaaaatgt 133320 
ggagcctaca gccttgtaaa aataggaaca tttaaatcca tactttagtc atttcctctc 133380 
cctgatttat gttttcttgg ttgtttgcag gcgcagtcaa cagcagtgga agagctggct 133440 
catgtgtggc tcaggctgac tccacaatca gagtgattat gtggaaatca atcagaattt 133500 
tcttcttaga agatgtttgt tctgacataa tgactgacat tccagctaat ttttattcat 133560 
aggcttctcc ttgcacaatg atctctattt ctctcaatgt tattctgtgt ctttcttgct 133620 
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tgcctgcaca ttctgaaaaa aaaaggaaag caattcccta cattttattt tgatacttaa 133680 
tgttgtttgc taacatattc agaacacaat gcaactttat atattagagc acttataata 133740 
tatattagga aaacatcaat taggtaccac cagttatttt agacaggtat ctcttaggta 133800 
aaatgtggtg aaaatagagg ggcaatgata gaattcatat gtgaaagagt caattggcca 133860 
tcattatgcc aaaattattt tgtggttcag gggacatttt aaactttaat ttttaagaat 133920 
taaactttaa ttaaatttat tgtagtttaa atgcaatcac aaggaaagac tctgaactgg 133980 
aaacaccaag ccaagctctt tcctagttct ggtcccctat aaaccctgaa agttaatgca 134040 
attattgtaa tttttaaacc accgagttct cagggtgatt agttatgcag caatagatag 134100 
cctgaaaagg accattcttt ctggcactga aaattaagaa gagtgattaa caacataaga 134160 
atgtagtttg tgctgtctgc cacagattta tgttgtagct tcagtgattg cttcaaagca 134220 
cccttactaa ttgccagcct attgctgata cctagctaaa ttagcatgtg ttagttttga 134280 
taccactata ggcatttaaa ggatagctga tacattctat ataaaatata attattttcc 134340 
tatctttagt ttatcctaaa gtagcagcta ttatattcgc ttcttgatta gctttataca 134400 
taaccagaga aagtaaaata tcacagagta tatttgaaga cataaaagtc tttggacaaa 134460 
gctcaagtaa tgaatcagga tgctattaca tcctccacat gacagtaaat atattttata 134520 
ttcctaaatt ttaagctttt aacaaaagca aattttatac tggactattt caatttctag 134580 
ggaaaggaaa ttatcattta ctgtctttga ggctataatc tatttagcca ttatttttat 134640 
attgatcaat ttataataaa ggtaagagaa aatactacca catatatgtt gtttatataa 134700 
ttttaacttt cataattaaa atactgggta tactgcaaga .tgtcaatatt tataatagct 134760 
tatatttgga aacaattata ctttatatac tcctaatcca ttgccttgtg tgtcattgaa 134820 
gaaattgaag ttcacaataa tgactgaaca aatgctcaaa gctggtctgt ggcagaacca 134880 
aaaatttcca tggcaccttt ctctttccat attggggtga tctgcctaat ttgactttct 134940 
gcttatatgt ctagactcta cccctcattc cttctgctag tgtgttgttg aaccctttct 135000 
attaacccac tctcgatttt tctgttgcac aatcaatctg tagcctgaat gttgagtccc 135060 
aaatgcatga gctacctggc cgagtaatct aactagatta gcctgagtat atcactttaa 135120 
ggccaatgta aagttatgtt gtaactacct catagttttg cagaggcaag aaatagaagt 135180 
taaggaagag aagaagccaa tcttgaatca gttagaattt ccttggacat agaaaatctc 135240 
tacaagcaaa aacccatttg attgaacaag caagcttatg cagttaaata tatcaaagat 135300 
cttgtcacaa aaagtgtaaa gggaaagtaa aataaggagg gagcatgaag aagccagctt 135360 
atattaaaga ttcaaagcag ggatataagg aaattcaagc caaaaataag gaagaattct 135420 
tataaatgcc ccaagaatga attcgatgtt tggtcgatat tccaacagat ttagtaatgc 135480 
tatttttaaa gtcaacaggc tgcccatttt tgcctcacac tctgactgca tgtaatctgt 135540 
caagtactct gatgagttct ggagatgcta atgcaaaaaa gatataaaat ccttaccttg 135600 
aaaatgtagt tcacaattta gccatggagt tatggcaagc atcttgtctt caattttagc 135660 
ccctcctcca tccctctccc agacacccat tgtcctcttt ctcaccacat ccaccccctt 135720 
ttttgtcctg tctttggtgc actatttata attcttctca tgtctaccac aactattttg 135780 
ggtagtaaga acatcagtat ggagacccct taatcctaac taatatcaac tctatctgtt 135840 
tggtattaat tcttgtttat tgaacttgag gcattgagta ctacaaaaat tataatacac 135900 
tactggtgaa cattacttta ttaatgatta ggttccactt ggatgtttgt cattcgtatg 135960 
tagctgttct ctccttgtag cctgtaaact ttctatggat gtcacaaaaa gagttttaag 136020 
gattttgctt aaattaaaat taatacatca agaaaatact atgcctcaaa caaaaaataa 136080 
tcagttaatt tagtttttag aaatctttta caaaaaacgt gatatttaat aatctttatg 136140 
caaggatttt tgggatatta aaattgtcac agatttaata atacgtttta ttgtataaaa 136200 
tttttctttt ctcttcaaag tgttgggaga gtgaattctc taagtaaaaa ctgaagagta 136260 
agatgttatc attaaacatt tcagaaaatg cacactatcg tagactgcgc actgtgaaaa 136320 
tgatcagcta gtagaggttt tggtgaacat atagtaactt ttatcaacaa gtaatgtgaa 136380 
tatgtgaaat ctattgcttt tccaaaagaa agaaactttt acacttcttg gtaaagataa 136440 
actttcatgt aagaaaaaat aaaaacataa ttttagcaat ccaaaccata ctaaaatcat 136500 
aaacacttga ccacatgttt gagtacccca aaaattgttc ctgtgtggat tcttcttcta 136560 
ggatgatcag tcctcatggc cttactactc tccatttctg tcccactgag gttagccaaa 136620 
gttcacttgc cttgtcagtt aagtttctaa atcagatgag ttatgttttc tataattgag 136680 
ttaaccacag ttaattgctt aatagactct gtctacacta actgaaaaaa aagaattaat 136740 
tgcttggaga aaatgcatta tatattttat tcacatctac agtcatccag gccaaagtga 136800 
caaaggcact ttcaacagtg gccagggcca atggcgacag cggggccagt gcttagctca 136860 
gccatgtgtt caatctagta ctttgggatg tggatataaa gaaaatgagt tataaatgaa 136920 
cttctcctta gagcttaatt cattattatt attattatta ttattatttc tttacataag 136980 
aaaataattt taggagtgct ttggcaaaag tttaggtaca atttaagaaa atgccatttg 137040 
taaaaaaaaa aaaaagacta gnnttttttt ttaattaagt ttttactttt aatgatttag 137100 
atttaggttt taaaaatgta tttcctagcc aaggtctaac accaggtatc tgagattttt 137160 
ctaaattttc tctgaaaaca tgagacgcac gtacagttat ttaaaatttg ctgcagccat 137220 
ttttaataga ttttttatgt ttgaagagat gccaatcgct tattatagta agctatgata 137280 
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gcaatgccaa tgaatcctca attccattgc ttaaatataa gtgaaataat aaaggaaaag 137340 
accagtattg aatatattgc ctcttcctgc tacactctaa catattcact acaataaatg 137400 
gaaagttgca catacaataa aaacttaggc aatttattta tttacttatt ttttatttat 137460 
ttttttgata tggagtctcg ctctgtcgcc caggctggag tgcagtggtg cgatctcggc 137520 
tcactgcaag ctccgcctcc cgggttcacg ccattctcct gcctcagcct ccctagaagc 137580 
tgggactaca ggcgcccgcc accacgcccg gctaattttt tgtatttcta gtagagacgg 137640 
ggtttcaccg tgttaaccag gatggtctcg atctcctgac ctcctgatcc acccacctcg 137700 
gcctcccaaa gtgctgggat tacaggcgtg agccactgca cccggccggt aatttattta 137760 
agtataattg ataatcaggg caatatcaga aaatcttgaa taatttgtac tctaaaatga 137820 
atttatgctt ttcagtgctg gaagcattat agaggttgtt tcaactatga aatcattttg 137880 
tcaaacgttc tttgaagtct atgaccttaa gtaaagcatt taaaacccaa tggtttagaa 137940 
acgtccagag aaacaagtat attggcaatt gtaaataatt tttgaagtat ttagcaaaac 138000 
agcgaagtac aataagcatc ctattcacaa ttccaatgta aaaagagatc tggtcatagg 138060 
cagaattata ctaagaaaaa ccctatgaag aaaataaagc ttgtttgtta cctccaagat 138120 
ttcaccatct ttttcccgtg tccctgcatc tatgccctac tgtttctgct tcctcattaa 138180 
tcttaaatcg acttaagtgg atactggttg tttaaatatt tcttatctgt atcacaagtt 138240 
caacaaggaa aatatcttga taataaaaaa agccaaagga agcttttaaa aatttaccca 138300 
tggattgtaa ttacccatga tccccatcct attcctttag aactgttaga taagagtttt 138360 
ctacacttga aactttgagt gtttctaaaa atatttcctg ctttgagctt accaagttga 138420 
tagataccgt ccagacaggg tgaactgaag caagagtccc tggctcgaaa gtgtgcctgg 138480 
catgttctaa taaaacaaaa aggccgcacg ggtgctccta aaagggaata aagtcagcaa 138540 
gtggggagag gggcatgtat gacttcttgt tgattattag aagaacattg agaaatgcaa 138600 
aaccactttg tggttttgtg aatccactgc tgttttcagg atacactgcg aggggtaatg 138660 
gcacaaaagg gaaatctcct ggttgatagt ttggtaatct agacagaagg tggtgttggc 138720 
tagaatcaag gtaatagcaa gtcgagttat cagaaatgtg gaaattggtt ttatattgag 138780 
aaacaagccg atggtgctgc tgagaattgc acgtgtagtg agggagaaca aagggactca 138840 
agagtatatt tttttaacca aagcagctag aaagatgaca ttgcctttgt acccttcatt 138900 
ctctaattgt attccccatc tctgccttca ttctctgatt acctttccca acaggtagct 138960 
ttagcttgtc cacacctaac tatgcttctg tctagggcct tgacacttcg cgattccttc 139020 
cacctggaat gcagttgccc aagatatcct catcaattat ctctcatatt tttcaagttt 139080 
ttatttgaat gtcacctttt caatacaact tgccttgccc cccatcttag attgtatccc 139140 
cctaaataca cttcctgtct accttctccg attcgttttt ctccatagcg cttatcccaa 139200 
tctaacaaac atattttact tattttattt attgcctatt tccccagtaa aaaggcagct 139260 
caataacaag gatgtctgct tgttttgttc actactctct ctccaaagcc tggaatagta 139320 
tttggcatat aatagcacaa taaatatggg ttagatgcat gaattaatgc agggaattag 139380 
ccaagctatt tactgtactc tggtctctga gaaaatttgc aagcaggaga aagattgctc 139440 
tatggaaaaa cgtatgaatc aaagcaaaaa actccatcat ggtgtatcca gatgaaagtt 139500 
gtttattgta atttagtttg tgataaagtc ttcatcataa ggaggatgaa aacaaaagcc 139560 
ctagtagtat cagatcctca gagaagagct aataagctga gctccttaac atgactcatg 139620 
ctttctctaa cttgggggaa tggttggccc caactctcac tgagtctgca cactgaggaa 139680 
aactcctcaa aatctagcag cctggaagga tgaatgcaaa agaacagtaa caccaatatt 139740 
ccctctaccc agctgacatt tctctcccat ccttcatttc cttaccataa ccctctgtgc 139800 
tagcccattg ttcatggata atgtccaaga atgtacaatg atatacaaga tcatgataat 139860 
tgaattccaa cttctgtcat ttttcttcgt gcccccccat atccagtgat tctgaactgc 139920 
tacagttctg taaaccaaac atcccttttc cccaatttta taactaaacg cttttacttt 139980 
tttattcttc ctagaatagc ctttcttatc tcatatcttt gtaaatttta attcatcctt 140040 
taataatcat ttatagcagt tcttcattag tgaaattcct gatacatttc ccaccctctc 140100 
tgaaaaatta attgctccct tattgatttc ccattgattc taatagttat gaaatatttt 140160 
attacaatta ctggttcaca catctgaatt ctgcttggag ctgtgaagaa tctgaaggca 140220 
agaaacatat tctactttct ttgtataaaa tggacttatc acataataat ggctcattag 140280 
ggtgaatgaa tgattaatat taacaaagaa aattgtcaaa attaggtaag cttcttttaa 140340 
aaagttgaag ttaaacatta gccatatgtt ttctggaaaa ctgagcactt tctgatgtgt 140400 
ttgaaatcac attgagaaag tcgtctcttt tgctactttt gtttactgga atgaagggta 140460 
actggaaaga tttcccagag aaaacaagac cacaaatgtt ttgaagtagg gaaaaatatt 140520 
ttaattgaaa tagaggtggt acaaagacaa gaaatatgac atgtgataca gtagatataa 140580 
gcgtatatga aatatatggg atgaacatcc atacaggttt cacaggactg aaaggtgcaa 140640 
ctggtacaga gagtctggtt gttgaagtat ttaaaactca caatcaaagc tctgatagta 140700 
ggatgcaatg tcaatgatta ctaatcgaga caaatttgct caagatcatt gtataattac 140760 
acccaaattc caatttgtga atagatttac tcatcgtgac ctagctgata acatggaaag 140820 
ataaacaagg cagatacaca gaacattttg cagactatgg tggctgcaag aatagctttt 140880 
cgcttgattc aggctttaag tgttagcctg aacactcaga acgcgattta ttgttggaaa 140940 
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gggattaacg aagacattta tttcagaggc tactttgttg ttgttgacat ttttgttgtt 141000 
■gttgactttt ttgttgttgt tcttccctct catctgggat atttcccttg cccttctgac 141060 
ggcttgtcaa tggggaggtg ttcgttcctg ctttatctat tgcatatctg tctgcttaat 141120 
atgaatctat tacacagaat aaggaaatac atttagggct gactaatgta aaaatactct 141180 
ttgatgagca gtacactgca acatagctca gagttttcaa atttattata ttggctcttc 141240 
caaatatgtt attcatattt gctgagagtt gcttttataa atttctgctt tggagcatac 141300 
agctcagtgg aaagatatgt gtttatctga attagctggc tctaatccca aatgaatatt 141360 
tgtacaaatc tgtttttaac attgatatta atttattaga caattagaat atggcaaatg 141420 
atgcaaagtt tggaacttgt ttcacatttt tttaatggtt tgtactcact cccctccccc 141480 
atatacgaga ttttaaaagt ctatagggat atcttagtac atatttatgc agattacttt 141540 
tttatttatg gaaggggatt gcttgatcct ccaggtggtt ataaaactgc tatttcattt 141600 
tgacatgata taaattggtt tagagaaaac aaattattct aaagtcttta gaaaccaaaa 141660 
tgatgctgtt ggagcatagt aataacgtaa ataaatctag aagatggaga tttttatatt 141720 
tctagaacta gatgggtttg tcctggatca tgcctctgaa tttagtctct gatttttatc 141780 
caggggtgat ttttgacttg tataaaacca tggaaaccaa gagccatcat aaagcaagaa 141840 
tcggtttaca atctttggaa atgtgtattt acgtaga-cca tgaaataaaa aatattttgt 141900 
tcaagaaaag ttgtttttaa tgctgctctg aacagcagac caaaaataac atttttaaga 141960 
gcaagcaagt cgtttttgtg ttttacttgc ctgccttata aatatggaat gttggaccca 142020 
gaagaatacc tgagtttagg accatacatt cttttaggta aaattttatt gaaaattaaa 142080 
gtatatatag aaaagtgcac aaatcagaag cctacggctc aatggatttc aagtgaatat 142140 
agccatgtaa ctagcatgcc aataaaataa cacattgata gagcattacc cgacccgggc 142200 
cttctcatgc ctcatcccaa aggtaattgc tattctgcct tttaattatt acatattagt 142260 
ttggcctatt ctcttgaaca caactcttca atgcttgtaa gttccatttt agaagatctt 142320 
gcaagtggaa tttgccttct cccactagtc agtctttaac caggagatat caagatggaa 142380 
tcagaggaaa tctaagcact ttaagtggtt ttgccgtgac agcatccttt gtgattactg 142440 
aaagaaagct ggaagttgat aaagtgatcc atgttgaaag tacagattgt gctttgatag 142500 
tggccatgtt gttcctgcca caatgtttcc ctgcactatt tttgcccaat attggattta 142560 
acagtaaaag tttaagtaat gccatattcc actaacatat aacaaaaaaa tggaattaga 142620 
aataggtagc aaattttaaa gatcagagaa aaaggaattg tgacatttaa gaattacatg 142680 
tggttagaga tctttatata tcaacatatt gaaagaaatt aattcagtca gttcaggcat 142740 
gacgtttatt tttaacaatt agtattctat tctctataat catgttccac agtttaaagt 142800 
gcttaatgag ttcttactgg aaggggacta tttcgggaga gtataatggc ctttaatcac 142860 
ttcactcaag ctattagcag gaaagattgt caggaaatat tgttactttt gttataaaat 142920 
caattgaact ttcacttgga tgaaacaaga agtttgtcaa tatggtatat aacctccaag 142980 
gtctttctcc cttttcctta tgctatttcc ccctcgtctt cgccatgtgt tattgattca 143040 
tttgaataat ctatgattat tttccttctt aatagcctga gaagaaaaat agcttctaac 143100 
taagcagcta ccaagtcctt aagctagatg ctatgtttat attatctcaa ttaatcccac 143160 
tcagtgtagt cgatggtatg accctgttac aattgtgcaa aatggagttt caggaagata 143220 
cacaatttgc tcattatcaa actctcataa catccacaga aaataattga aacacaaagc 143280 
catatgactc ttaacctcta ctgaaagatg cttttatata gaatctaaaa tgttatgtta 143340 
gatataatat attggctggt gaagttttat cttttccaaa acagaacaat attttagtgt 143400 
tttctcagtg aaaatagtga acactattca gattcaggtg atgatatctt tgattatcct 143460 
catgagagag tcagtgtagg tagttcagtg ggtctcaaag catagtacct ggacaagcag 143520 
caacagtgac atctgtgagt ttgttacaca cacacattca tgagccctac cttaaatctc 143580 
tgaatatcca agcatgcgat ccaggaaatt ctattttaat aggttctcga gatgattctt 143640 
atatgcatta aggtttggga accactgaca tatttcttaa aagcacgaat tgcggaaaca 143700 
gactgtgtaa gtttatattc atgttctgtg aaatactagc tatgtgacat tgaacaatta 143760 
atttatcttt ctggatctca gtcttcttca tttatgaaac ttatacaata atcatatctg 143820 
catcacgttg ttttcagaat tgttgaacac acatagaaaa gtgtcatgct agggccaggc 143880 
acggtggctc acgcctgtaa ttccagcact ttgggaggcc aaggcgggtg catcacctga 143940 
ggtcaggagc tctagagcag cctggccaac acggtgaaac tccatctcta gtaaaaatac 144000 
aaaaaattag cctggcattg tggcagcagc ctgtaatccc agctactcgg gaggctgagg 144060 
cgggagaatc gcttgaacct aggaagcaga ggttgcagtg agctgagatc gtgccactgc 144120 
actccagcca ggaaacaaga gtgaaactcc atctcaaaaa aaaaaaaaaa aaaaagaaag 144180 
aaaagaaaaa gaaaagtggc atgctcatag caagtactat gttagtgata gtaaatgcat 144240 
ggatcttgtt ttcataatac aaattaatta atgatataaa gcattcatta aatatctcaa 144300 
tatttggaac cttttctttt aataataaag tctagtaaat gttgttattg aatagaaata 144360 
cacttttaaa aacgttgtgc attattttaa aaatatatga atcagaaaag gaaaggtgat 144420 
ggaaaactca tacaaatata gtaaattgca ctgtttttgt gaagatgatt ttatgcagaa 144480 
ttcaaagtag atgcaattag tgcttcaaat tagaaactgc aaatgatgaa ttcacaagca 144540 
catgatttaa caaacaaatc agagaagaga aaaagtttct gatgaatttg agtttctaca 144600 
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tgtggatttc atcttgtgat aagccattct caggagaaag catgagatgt tattcgtcat 144660 
tccagaaagt gggagaacat aaaagttgag aacagttatt gcataagcaa gaattatcaa 144720 
tgaacacaat taatttaata catacacagg catttgactg tatgagaaaa agcatttcat 144780 
gttatttgct ttgtagatac aatatcagtt tcgatgagat tcatgctgca cggatggtta 144840 
cacctgtcac ctgtgggtgg gtaatcttaa tacctgtgat gacaagtaga ccacggggga 144900 
ggactcgcaa aggatttgag tgaggagcca ctgatgctgt gtaaatgcca gagccgtgac 144960 
cgtgcacttt tggggggatg tgcttaaata tcttcagatt tcacagaagt tagggcatca 145020 
acggaaaata cttttgtttt ttctagctca tctgcacaag ttgaacacac attgttagat 145080 
agtgtgtatg tcagagcata tgtgaacaaa tgaaggcgaa aagtcaatag ccatacattt 145140 
gtatttgtat tgctctttat caatctaaaa aggtacatta aacaataaaa tgacaattca 145200 
atttttaaca gcattctaga ctctagaagg tgttttctaa tccactctct taagccattc 145260 
tcacaacagt acaagggaga atttcaatga gaccaggtca cccagcaata tggtgacaca 145320 
gatgagacac ctcctcagat ttttgtttca aaatttttgt tctgcctact atgccataca 145380 
tgctgtaatc atttaattag aatttatatt ttaataaatg aaaactaaag agtttttgca 145440 
attgtgttaa cactcacgtg ttaaatttgt aattatgtcg gtaaataaaa agagtttaaa 145500 
tccacacccc attagccagg cgtggtggtg cacatctgtc tgtaatcgaa gttttagaaa 145560 
actgaagtgg gaggatcact gagcccagaa gaggtggaaa ctgaagtgag ctatgattgc 145620 
atcactgcac tctagcctgg gagacagagc acgacccttt gtccaaaaaa agaggaaaaa 145680 
aaaaaaaaag aaaacaaaaa ggcgcaacac agagagaagt gatcttatcc ttatccttat 145740 
gctagatata tatttttttt ctggtcagag aattacaagt ctgtctaggt aggctcacat 145800 
ttgagaagag aaaatcctgt tctattgact tagagagtta aagaccctgc cataagcaga 145860 
gagggcacac gtgggactct gcgccaggac aggcacaact gtgggggacg cttcatagca 145920 
gcttttgcag atctcctctg cacctgaagc gccgcctgta gaagctagat gaaacctcac 145980 
aatgccaaag ggttgctggg cttaggaagc acgggctgca tagggataac tagatcttta 14 6040 
tggatcctga agccactgcc ctttgtctgg atatctgtct ctttgtgtgg ccttataggc 146100 
tgcatttttc ttgcctgtat gcctgaggtc ttgatttagt cacactggca gggtacaaac 146160 
tggcctgact ctgccagtga cttcacagtg cagttgcaat gtcatttagt taccttccct 146220 
tggaggaact gctactgttg atgaagagtg ctttggacca aatattttat gggcccaaag 14 6280 
ttcatgtatt aaagccctaa ttcctaatgt gatgatgtga ggaagtggga cctctgggag 14 6340 
gtaaataagg tcatgagggt agagccctta tgaatgagac tagtgtccat gtaagaagag 146400 
atggaagata tgatctttct ctccaccatt tgaggacaca gcaagaaggc agccttctgc 146460 
aagtcaggaa gagagccttc accaagaacc ctctctgtca gcactttgaa ctcagactcc 146520 
ccagtctcca gagttatgaa aaataaattc ctgcatttta ggccacccag tctaagacag 146580 
agaggtaaaa tatatagtgc ttgggccact cgtttgttat taaattcact atgtattcag 146640 
cttcacttat atccttagaa aagtttgtga atgcagttga atgggtggag aacggtagtt 146700 
gcagataatc agagaagaag agtagagaca gggtgcaggg gaaagtaggt cctttaaatt 146760 
gaaaaatgat aacatgatcc attcatataa atgcactttt gttcaattct gctccagctg 146820 
tattatccag cttctctaaa gaatggtgcc tgtacaggac agggtgatct ttaccctgat 14 6880 
gaaagatggc tctccaggca gatagcagga gagacattcc taattgattc caggagggaa 146940 
tgccaccacc atcaccacta tcatttatgt tgatatcaaa actgaaatgt caatgatcat 147000 
ttctgttctc taacatttcc aatcagataa tttaaacatt ttactctaag agtttttcaa 147060 
ctcaaatgtt atgcttcctg ttttgtgcat ttgtattatt aactctagtt tcatgggagt 147120 
cccagttttt atacatcact gtatagcttt gatgttaaca tgattttgag ttaacataat 147180 
ttttgccaaa gtgagtatta acctagtttg atcacacaat ttggttattc tttattctga 147240 
tgctaaggga atccaatgat ttttgttgat agtaacccat caattctatt gactgtaaag 147300 
agtaatctag aagaaattta aagaaaattt aatcagattt tacgttttct gtcgctaatt 147360 
tggcatctta atagctgttt attttattcg aggtgcatcc tgagtcttgt aatgttcaca 147420 
ttgcaatatt aaagcataac agatgagcct aaaatgttca gtggttaatt tgtacagatc 147480 
aagatgttgt aattgcaata accttaatgt ggctaaccct tgaataacat aggtggaaac 147540 
tatgtggata aatccactta tctgcagatt tgctgccacc tcggttaccc cagacacaga 147600 
aaaaccaaac cctcctcttc ttcccccttc tcagcctact cagtatgaag acgatgagga 147660 
tgaaaacctt tatgatgatc cacttccact tcatgaatag taaatatatt ttctcttccc 147720 
tatggatttc ttaatagcat tttcttttct gcagctttat tgtaagaata cagaatataa 147780 
atcatacaac atacaaaata tgtgttaact tactgcttgt tatctgtgag gttttcattc 147840 
aacagtaggt tgtcagtagt caagcttttg aggagtctaa gttacataaa aattgctgac 147900 
tgcgtggctg gggctggggg agttggcagc cataacctcc acattgttca atggtcaact 147960 
gtataatttt ttagcggaag tgtaggaatg tttagtatga tacaaccatg gagaaatatt 148020 
ttgtataaat atgtgtaaaa ttaagactgt agaactaaaa ataatacact tagttcatta 148080 
gttattaata gctaatattg atttagctct gtgataataa attgacatgc attacctcat 148140 
ttgtcacatt gtcatgagat caggattttt atttgtcttt catttacaaa tgaggaaact 148200 
aagtctaaga ggagttaagt gatttaccca gtattacatg gcaaaaaaaa agtcagagtt 148260 
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taaacttaaa gccgagtctt tgcaattcta aagtccatgt tcttagctac tgcctttctg 148320 
aacttattag tcaacctgaa ttctgaggat ctcacaaaaa gttacgtgct aattattttg 148380 
gaagcattta tattatgagt ttctaaaatt acaaattctt ttatgtatga tagggaggta 148440 
aaatcatggc atttagaggc aaacatattc tatttagttc ccatttgtgc tacttaacag 148500 
ttgtgaacct gacgagtacc atttagtaac ctcaactgtt aaggtgtgta aacaagttca 148560 
tttcatagag gtactgggga aagttaagcg agagaatata tgactaacaa aaaatatgat 148620 
tttcattttt tccttgtgtg tatacttaag gggaaattta ggactgccta cttgttatct 148680 
tccactgaaa acaaaacaaa acagaagatt aaaaatacaa acatgttgcc cttgtttatt 148740 
ttgtctaact caagttgttc tgatatcatc catttctgtg taaccacaag cacagttggt 148800 
aaagattgag ttagcaatgt tccctcaatt ccagattgcc ttctctacca tctaatttct 148860 
ttcagtggta ccacaattct tccatgctgt aatagaaact ttgaaattat attttgctat 148920 
ttttctcttt tgcatcatta ttttacattt tgttgtcaaa tatttatttt ctgcagaata 148980 
tcttataatc ccatgcttct aatttctagt cagtgacttt cttgattctt attcctctaa 149040 
tgtacaatcc aaataacact ttcttctctt ctttcattta acaagtaata atttaacgtc 149100 
aacactgtgt catatctgtg gtgtcaccta tcagtatgca gaacaccaag ggattcccac 149160 
tgtagtgtga ttcttcttct tgtgtattaa caccaatgct ttcttggcct cttaacgctt 149220 
cctccttgat atttttcagt tccaggcact ctgccacaca ctctcatggc cacacacagg 149280 
accttgttaa cacctggaaa ttctccatct ttgaactatg aaagattagt atctgcaatc 149340 
agtaaatggc atcatctctc atagctttta ttccccaatg ccttctattc agccttgaac 149400 
tttgtttttt ttctgcctgc tgcttcttcc ctcattaagc cagccacatc attcgatccc 149460 
ttcactcaca gcattttcaa gaacatcccc tcttcctctt tttatcttca gtcactctgg 149520 
tgtgaatatc ttaacaatta tacaatacac tttcagtgtt cctagactcc tgttgttaga 149580 
gacagctgtg taacagaatg ccctggtgca gactgggggt ccctggatac cgagggtctc 149640 
cagtttcagc tggcctcctc ccttaggtaa gagatagaaa caagaatctt agtattattc 149700 
taatgttcaa gtatctggtc ttttctgcta tagttttccc ccataaatct tgatttttag 149760 
ccatattaag tgtttttttc ctttaccaga taaacaattt tgtctccaga ctttctgcca 149820 
ttttcacttt ttttacctgc caaacttttt gtgtctttac gtactcaatt taaaatcctc 149880 
ctttatgaga tttcagcttt ttctcttgga ctgagattga gaacctctct ctgggctcca 149940 
ataatttcaa atgtctgctt aaattttgaa gcataatttc attttatgag aatacataga 150000 
tttcagacta ctaagcaata gaaatatgga tacaagacat acacgggacc acttggggca 150060 
tagtaagtat tcaagaatac tgaatacttg gtggatattt ggtgagtgta ttaggccgtt 150120 
ttcacaccgc tataaagaac ttcccgagac tgcgtaattt ataaagaaaa gaggtttagt 150180 
tgactcacag ttctgcatgg ctggggaggc ctcaggaaac ttagaataat ggtgaaaggg 150240 
gaagcaggca tgtcttacat ggcagcaggt gagacagaga gagagagaaa gaaaaaaaac 150300 
agccacttat aaagctatca gatctcgtga taactcactc actgtcacaa gagaagcaag 150360 
gtggaaacca ctcccatgat ccaaacacct cccactgtgc cccttcacca acaggtgaag 150420 
attacaattt ggattacaat ttgagatgag atttgggtgg ggacacaacc aaaccacatc 150480 
agggagtatt ataattggta aagaatgact gtccctagtg cattgaaatg taagtctcat 150540 
ggaactaacc tatatttgtt taatgcaaat ttgaattcaa gatggcagac taaacatatg 150600 
cattttttac ttatctttcc aagtctttgt taagttttaa acggattata taaaagagca 150660 
agaagacaag aattaaaaaa gaagtagtaa gccacatggg aagtatgcat gagagttaga 150720 
agctgaggaa gtgagtagta atctaaaatc tcagagaggt tttgggttca taggtgtgca 150780 
aaaagagaat gagcattggg tccaaaaaac aaggcgaata atggaaggtg aatagttgaa 150840 
cagttgcgct gatcagtcta gctttgcagc tcattgtcaa agaagagatc aagttggaat 150900 
ttttggccaa atactatagg gttgtctgtg atatttaata gtaactgagt gagcagcctg 150960 
agagtcaaga ctgttagcta tgttttgaga ccccaaagta aaccccttgc attttatcag 151020 
ttgacagaag agaagtgctc tacatgtgcc ctgacttaag cttatttttc attcgtaagt 151080 
gttaggtaat ttttagtaat atcattagcc ttctatatgt agagttatgt gagaattaaa 151140 
ggtctctttc cctgcaatct cagacagaat tctactgtac atgcaaacat gtattatcaa 151200 
gtatggacaa aatcaattgt actgagtata atgaagtcca ggttctctgc cttttatgtc 151260 
ctctaattgt acacagcata aaatgttgac tcaagccaga gcacaaaata ctcaaaggac 151320 
agaattcagc ttatcttatg gtcagatatt tgcactaaaa taattgaatc agtgattgga 151380 
tacaaagact tggatgacat cacaaagtgt tcaggctctt gaacttccta ggtccttata 151440 
accttctttt gaaacaatag aggccgggca tggtggctca tgcctgtaat cccagcactt 151500 
tgggaggcca aggcgggtgg atcacgaggt ctagagatag agactatcct ggccaacatg 151560 
gtgaaatctt gtctcttcta aaaatacaaa aattagctgg gcatggtggt gtgtgcctgt 151620 
agtcccagct actcggaggg gtgaggtagg agagtcgctt gaacctggga ggcgggggtt 151680 
gcagtgagtg gagattgtgc cactgcactc caacctggtg acagagcgag actccaaaaa 151740 
aaaaaaagaa aaaaggaaaa ataggagtac atatggaatt ctaggcatct cacaaatctt 151800 
taacgttagt atttcttcaa ttaaaatact ttttaaaaat tctaaccgct agaccactag 151860 
agaaaaatac ttttggtttt attaaatacg ttatttaacc aacctctggt ctcatcagag 151920 
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agaaaaatag atggcatgtt tttacctttc cttgtttcag gcttctgatt agaattccta 151980 
ctagtcaacc atttccaggg tgcacattct taatatttct caaaatttat attaggctga 152040 
aacctacaaa atttattttt ttaagatcaa agaatgtcaa atattagcaa gtcacatggt 152100 
ttatactgtt aaagcaaact aaatatggcc tgagaaagac tttgtacttc tgtatttgag 152160 
accttgtgga tgaaccgcaa cctaaattaa taggcagaga aaattttaaa cttaacttag 152220 
gagttagcac ttataacaat agctgggtct tggtaaatcc cagcagccat acttcagcca 152280 
ctcatacacc gctgagtatt caaactgtgt tcaaataagg caaatgccaa tctgtaacca 152340 
gtccagctgt ttctgtacct cacttctgat tcctgtatgt cacgtcactt tatttgtcta 152400 
taaatttgtt ctgaccacaa ggcatccctg gagtctctct gaatctgctg tgattctggg 152460 
ggctgcccaa ttcatgaatc gttcattgct ccattaaact cctttaaatt agtttggctg 152520 
aaattttcct tttaacaata ccatctctcc tgggagagtc aaactcatat ggcaggtgag 152580 
gattcagaga gattgttttc caaaagatag aaagtaaagt tgccattttc ttaaggtcta 152640 
ggcagagaaa ctggtatagc tgtacttcta ccatattcta tcagttacac actgtcatgg 152700 
agcaggctct tatgcaatgg aagaggacaa aggtggcctc atcttccctc catgctctgt 152760 
tacaaatatc aatctgtctc cctttgcatg ccctgtcttt ccttggattt taggctattt 152820 
catttccttg caatatcagc tctctgttgg attcaagaat agagatgatt ttgtagtatg 152880 
ttccactatt tttcccccaa attattaagg tattagtaat gctctttcat gttccatatc 152940 
ttaagtggaa gtggaacttt cttgctataa cttttgaatt agcaattcta gactctagac 153000 
tccaaaggag tctgtatctg caaaagttct gagggaaaat aattgtgaag ttaaaattct 153060 
aaaaaagaaa aaatccttgt ttaactaaat tatcattcta aaataacaaa aaatgatatt 153120 
ttataattat aagaactaag gaaattgtct ttatataaat gtctggagaa atagatttct 153180 
gagaatatat ttcagcaaaa caaggaaaaa ataacacggg gatatactgg gactgaagta 153240 
acagtgcctc ttaccccaga gataagtgac atgaggtccc agaatcattt taagcaggct 153300 
tagaagccga atgtataaaa tatatcatgt aaaatataac gtgacacaga gggcacttag 153360 
ggtcattctg attcttgacc ttgatctgag tttcaatgac gaaaatagat tggccaaaga 153420 
atgacatact agccacgcct ttattttcat tacatacact ttttactcca tactgcacag 153480 
atatatacaa ttattttatc ttcctaacca ggcaataagt tacttgccca tcttcttatt 153540 
aacaataatg tagataaagt atcttatgaa agtagagaaa tgtagtcagt tatttgataa 153600 
gtgtgtgagt ttgttcatat ggtcttatag gaaataacaa acataattta aattggtaca 153660 
ggttgtacac agaacattta aaaaagttct acctaatctt aaaaccatgg atttttactt 153720 
aatgtttctg cattttataa tcattaaaaa caatggaaat tttgttgagc ttaatatgaa 153780 
ttaaaatttt tatattagga tatcgtaggt tttgccaata gaattctaaa acttaaatat 153840 
atctaatttg attactatgt aggttatgca ctgtgactca ttagaatata tgatgtgttg 153900 
tttaaagtca taatgatcat cagatgaaat atggccatct ttaaagttat ctcatagatc 153960 
taagtagtga ataaacgaga caagagtaat gtaaaaatgg ttgacattat caatgaaggg 154020 
tcaatggata actgcgtggc tccggttctg atactcagaa ggaaaccaca tcacttaagt 154080 
agtaatatga ggaaaatgaa ccacattttt aggaaacatc acacaaactg taaaccaaga 154140 
acactttact attaaacttt ttttcaaata tgccaatgcc atgaaaaaaa acaaagagct 154200 
gaggacccat tccagtttaa aggagactca agagacatta cgaccgaatc ctctatgtga 154260 
tgtaacataa gctatcaaag acattaactt ggtcaattca taaaactgaa atctggaagg 154320 
tagatcaaat aaaggttttc tatcaacatt ctgtttcctg aaacggtagc ttgtggttaa 154380 
caaagaggat gcccttgtag ttagtactaa gggttgtagg gtccgtgtgt cagtgacttg 154440 
cacttggcag tgagtgcgtg ggcgtacgag catgcacacc acacagtgca cacaggggtg 154500 
atgcaacagt cagtacatcc aggaaaagga tttatggcag ctccgagtga gtgcgtgggt 154560 
gtacaagcat gcacaccaca cagtgcacac gagggtgacg taacagtcag tacacccagg 154620 
aaaaggattt aggggagctc tgagtgtgtg gatgtacagg cttgcacaca cacagcacac 154680 
acggagatga tgcaacagtc agtacatccg ggaaaagcac ttatgggagc tctgtgtact 154740 
attgttgcaa tttttctgta agcttgaaat tgtcctcaaa taaaagtgaa aaaatgttac 154800 
cccaggaagg attggcacaa cattatttta gatgaatact tcgaaaactt cctttatttt 154860 
ttgaaagtta ctggaacacg ctgattacac atttgggatt taacccctct gccttaagta 154920 
taaacttaga cgtaagtgta agaacccagt ttaagttatc ttgagaagca caaaatttga 154980 
gttactacac agtttccttt taaatggaaa tgaaatacat ttcaagtcat actaatatat 155040 
gttggttttg aactacttgt agcccttcct tagaaaacta aaacaaaaac tctcttttaa 155100 
tttcatgact ccttcaaata cggaaactat gaagatttat gtcttatgaa gtttatttta 155160 
cacataagtt attagattct tttaaaatta ttattattat tatttatttt tagagacaaa 155220 
gtctcactct gtaaccaaga ctggagtgct gtggcaaaat catatctcac tgcagcttca 155280 
aattcctggg ctcaagccac cttcttgcct cagcctcctg agttgctggg attacaggca 155340 
tgagctacca tgctgagcat acaaattatt tggtctgatg aaataattat tgataaaatc 155400 
aacagaagtg atcattataa gttaaatatt tacagtgtga tttagagaac atacctgtag 155460 
aaaggattga aacaataaaa tattattggc ttggaatttg tatagaattt tgcccttcaa 155520 
tatagactgt catctaaaca ctcaaataga agacattcaa atttacaaaa tggtgcttta 155580 
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ttggccagcc tagataattt ttcttgcttc ctcagccccc tttttactta ctcctgaggg 155640 
cttaatatca cagacaactg aaattcatct gcactctact tactatgtgt aaccagctgt 155700 
gattttattt tctgatgcta ccacatgaaa aaaaaatttt tttcctgagc tatagatgtt 155760 
aattggtgtc tttgtatctt gctaattaag ttctagcaac accttctccc tcttcctcgt 155820 
ttgcacttaa tgagctcact tgttgtatta atttataaga taattagata aaacaaaaaa 155880 
agttgaattt cttcaggagt ggctttggaa agtgcccctt aaatattcaa acatggataa 155940 
aaatgtgtac attacaagac aaaaaaatca ctttaagaag aaatatactt aatgttagcc 156000 
tgtagactca tactgacatt ttaaataatg tgatgatcag ggcttataca aaaaggaaga 156060 
gattgattca aaagctcagg caatcacaat tattttggtg taagcaaaac aggatcacat 156120 
atggtacaac atgatgctca taccagaaac ctttgcaata tttaagcaca acaaacaaat 156180 
tagttaagca atgtgcccct tggactctaa ttaccatttt tatttgcatg ttcatttttt 156240 
tctaaaattt ttttaagcca gtggccaggc agtattaaat tctttgctgt ttatcatgat 156300 
accagaattt atttgacaaa aagtggaaaa atcattgacg tacagtgaaa ctgatcattg 156360 
tttaatagta gaattctgtc gttcatcagc agttagggaa ttgtggaaat tgttttgagt 156420 
gagaacatcc agatgttatt gattccaatt ttctggtcca actcaaactc atttgggcca 156480 
ttagctttcc aacttgtcca atatcatcag aaaaggcaat ggcatctctt actgctctat 156540 
ttttgtcttt ggttgagtaa tggattccac ttgactggca tatatattct ggctgccctt 156600 
taggaaatct aagcatttga tgtcaaaaat acacatcttt taaaatactt cttaatattg 156660 
aggtaaagca aacattcaaa attcaaaata tcaagctgta aatgttaata ctaaacagga 156720 
acatatccaa tcgacattga aattgtatat aatgaacaag aaacaataaa ttaaagtgga 156780 
aagtcataat aatatgcaat atttcaggta ttctttgata gatgtaatga cagaagctac 156840 
taccttgcct gccccagtgt tagtaacatt gagctttcca gctaggatta gagcattttt 156900 
atatgaatcc ttaaaaaatg caataactac tcctccagga gaggatgctg tccctgcttc 156960 
ctgagggata gcactaatat ttccaggttg tctgctttac atcaagactt tgctttagga 157020 
tggaaacagg attgtaggga gactgtgagc tcaacagtaa gattagtttg tctccatttg 157080 
atcttttgat gaaatgcatt ggggtaaaac ctcttcccag tgtcattagt agctcatgaa 157140 
ttagaaaaga taatttttct cctacatatt caagaagaat cagaaccagg taattgacat 157200 
agccaactaa acctctatag ttattggttt tgttctgact caaatgctaa cttgaatccc 157260 
ttatgactgt taagagcata aaggggaaga gaatgcccag ggaggcaact ggagagacag 157320 
agacagcaga gctgtgagaa gactgtttgc agaaaatatt cttcccatgt gtggacttgg 157380 
cctttgaagc agagaattac aacaggaaat . taggaatagg ctggtgtagc ctcataaaca 157440 
agtacaaaat tgtgtcagca caccaaaata tttcactaga ctgcccttcc cctaggaggt 157500 
tcacaggtgg tataataaat gaaatcagca atccttctac atgtctcgct tatatggtta 157560 
ttttggctca gccgaccatt agttatttgt aagctggggc ataaatacag acactagttg 157620 
aatatttatg tacagctcca gaatccacct tcactaattt gtctacttat ggggaatata 157680 
gctgtggggg gcatggaaaa tatttactct gggagccctt aaatgtaatg catgaatatg 157740 
aatgtgctct tggcacggga ggtcctggaa ctcttggtag gtgttgggtc cctactgacg 157800 
ctttatcatt ttcttttttc cttcaactga atataaaact gaaaagccgc ttttaacagt 157860 
tagtaagtgg tgatactgcc agactcatcc tataagaagt tagctaaatt ttatccagac 157920 
ataatatttt ttattagctt acttgtagag tccctcctca gagcccaact aaacgtggtg 157980 
ctaaacattc ttttgccttt gtagaggaat aacattctga gactcctttg ttttttcacc 158040 
aaacagtaaa attaaccagg tattgaattc atactctcat taaagtcatt gttaccattc 158100 
ttttacaaaa aaaaatactc aaggagtaaa gaaacataaa aattgacact aaaacaatat 158160 
cacctacgtt catctttctc tggatgttag atttggaatg tatgcataga aaatggtctt 158220 
agtgtcaaag atttaatgct gcaagagaaa tcatctccaa aagaatatct cctttaattc 158280 
tgtcttctta tttatcaaat aatatctagc gaccgcactt ttcttttgtt ccttttgtat 158340 
aagaaggaat tatagcggtt tgccacatta gagtacttta aaagtgcttc ctaaaatttg 158400 
gtagagaatc ttcttaaaat tttactccac cctgcatctt tctggatatc tatcactcct 1584 60 
gagatggttt ttccaggctt agattagttt tctaaatctc tcagctttct tttatctcaa 158520 
gtgacacagc agttaaatta catcatgagt ttagcctgga ctctcttgag tcccccaaga 158580 
aggaaaactg aaattctgca aaatgaccta tatttaaggg tgaaggaatt aagagacatc 158640 
tcaagccgtc cccttattag aaggtaaact aggaatcata aaataatata tttgaccgag 158700 
tgctcagaaa tgattaaatc caacaggctt gccaacagag ttgtaccgta gaattgccaa 158760 
gggagttatc ataaagcaca gatactgggc tctgcctttg accttctggg tagataaatg 158820 
ccagtgattt tgaaagtaat tccctgtgat tctactgaat ctccacaaat cttgtgtaac 158880 
ctctcaattt taaaaagaag tttttttttt ttttcagttt aagccattta ttcaaagtca 158940 
aacatgcttt agtgatggag aagaatctgc ttttaaatgt attaaaacaa attgatagga 159000 
atcatgtttc ttattgtttt cctgtataaa gttagacaaa taggatattt ttgaagttat 159060 
aaatcattta aataccctta aactgtttac aatatcgttt gttatatttt tatcttctgg 159120 
ggtaatgtta tttaaattaa acgaggtact gagtgcttgc tgaaactaac tagaaatctt 159180 
aggccttaaa aatagctatc tcatattttt aaaagaaaca tcaataattt ttatatattt 159240 
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ttaaatttat gagatggaaa gttagctttt actaatctta tcaggctaaa ctaaataata I5<»nn 

atgaagaaaa tgaagagttt agaaatattt tttggaatat cag?gaa"c SSaqa lllllo 

aaaaaaaaac cctcatctat atgtgtgagc gacataaagt cacact^gt tgaaStacaJ lllllo 

aatggccttg gaaaatttca tggaatgaaa ctaaatggca gcacatcS ^"caqata lllllo 

gggtcttgtg gttgtggctc agatcatggg ggtgatggca Lctcttcca aagg^a'L? 595 40 

tl c?taa a aa C ZllZT^a !f a * ta «" * CatagaCtc acagatgagt aat'atgtcag 1596 0 
tttcttaaaa gaaagagttg ttaggaatga gagtaaggat gtgcctcgat tacaataccc 159660 
cagatacgat actattctga aaatggtgtg ggggttttgt gagggatgcc aaccacacat 1597 2 S 
tgtgctagaa atctgcatcg ggaagaaata ttttttgcta catgttacaa atatgcctat 159780 
cTztllTtl tCaaata f at "tgtgttt. ggccaggcgt ggtggctcac gcctgtaatc 598 J 
tttrr.lt Wgaggctaa gccaggtgga taacctgagg tcaggagttc gagaccagct 159900 
tggccaacat gatgaaatgc catctctact aaaaatacaa aattagctgg gtgtggcgac 159960 
gggcacctgc aatctcagat actggggaag cttgaagcag gagaatcgc? ggaacctggc llolto 
aagtggagat tgcagtgatc tgagatagca ccactacact ccagcctggg ?gacagga?g 16008^ 
agactccatc tcaaaaataa ataaataaat aaaaataatg tttttataa? t?tttcatag 1601^ 
aaaaataaac ctgaagttag gcagtgcaga cttgaatttc tcattctcaa ttcccattca 160200 
ttttcctgtg tttttgaggg tttctcctgg aaagtaaaac aattcaatgc tttltc"" iSS 
gcccttcaga agtcgggaca tcatcacatc attttacctt taagatat? g gactgga"tc S 
atagaggaat gagccatata caacatacat aggagtatga agaaagggag acagc^tca 160380 
ttlltltttt 3gtaaCtaat attgt-gttt aatgtaatga gaaataaaac tagSgatga 604 4 S 
ttgatagaat gcagagggca ctttaaatca gagagacgat ttgaaagaaa gaaatatt?t 160500 
tatatacctt gatttattag agatagatta aaaagttcgt ttaaaaaacg ?gatcaagaa lllllo 

ItclZlTt tllltttt? " tttCaaat ^catgacga taatgctgca' ta'atcacc" 160 20 
ctcaaaaatt tcacggcttm gaacaacaaa tattactctg gagctctctg ttctqtqqtt lfiOfifln 

cSSE'J tggCt * CtCt tgaCtt ^ ct WctgwaJ ga?ggatctg cttcaggctg 6074? 
caagctggct gagcttggct catcacgttt cattacgagg cacagactga agtgtttgqa 160800 
c a t a aa C cta a L 9 "° tWcat 9 ttctt taaag gcactgagag gagc?tatga at'gaatgaag itollo 
caaccaS occataat tg % tgaagtC aaaat 9^ag gactgggaag tacatgccac 160920 
aJcatJcaa 3 ctttc™ agCt f cgat ^actacggg ggagtggagt gctggtacga 160980 
a ^!" Cagt f " cacaaa acactggcaa aatgcaggtt attcagacag aatcgttaaa 161040 
tacaaaaacc tacttaaaac ttaagggtgt attagctatc tgtctccaaa tgtacttttt 161100 
rttttttT, tataagg ! Ct ttCaCt ^ C ^ aa.gt.tgc. ataatgaaga agataJttgc }S»S 
llltlrtT g9gaaagtgt ttttcaggta gaagtgaatg cagaaacaac Jttccaagc 161220 
«"a«S? ™™£" 9 " taggCtggaa -^^attct cttaatagaa aagaaagagc 161280 
acttggaatt gagatagaga cgctaaactg agctactcct gttggggctt tgctgatcct 161340 
tttalttt I Ca " Cattaa "atcacaga agccacctat gt.ctcJgc. t? Ca ?tgggI 6 4M 
tta? ct ?caa ^J 9g , ttCt taaCttCtCt a tgtttagca ttttgctt.g aagttagcac 161460 
ttatcttcag gtattataag gcttttaaaa tgacattttc ccctctagag gaagtatgcc 161520 
tttgtttgtt tatttgtttg ttttgtcttg tttcttagag acagggtctt gctctgtjgc 161580 
tZTrr^* f gaagtg9C ""tcatag ctcactgtag ccttgaactc ctggactS lllllo 
tttlltttlt tr t t ™ 9tC V CCCaagtag -tggggctac atgcttggct ..Stttt.t 161700 
' fct " taga = at ^ggtctg gctatgttgc caaggctgat ctcaaactcc 161760 
tggcttcaaa ctcctggact caagtaatcc tcccacctga aggtgggagg ccaqcacctt 161B?n 
St^tS tllT'Tr gagccgctgg tL^ac^ tgSgaSg 0 

atltltcttt ITtJt I C3Catataat acatacttag gttttctcga tgatgaLtt 161940 
tl^tfff tactttta 9 fc aaaccgtatg tgctttgaat taataaaaaa gttgtgattc 162000 
llStSttt UtTTt tCCCttCtct tttggaggtg ctccgttaaa agattccctt 62060^ 
ctTttllla TrTr^ 9 9 aatt 9 a "c aaaagcattt ccaggcatta ttcttgttcc 162120 
cggttggctg acatagagcc ccactcactg atggcactgg ggaaggacag agcctgttgg 162180 
ctg aa ga C c a t TatZllllt ^acaaaagg atcctaagac g?tt.g.c2 I622I0 

tatalllTtt »rttn tggtaagaaa tttagataaa tatttccttt 162300 

tgtaaaaatg acatgaattg taagtttcct acactgtaaa ttagaaaatt taaattatat 162360 
aaaactatat aaaattataa tttttttctt aatattggca tttttatgtg tactaagg^a 162420 
ca= a tttt a = ^T^T tcgaattcta ttgggagtgt attattt?ca actcataK 162 480 
caccttttaa acatttgatt cttataactt ttccttctca ggaaaactga acctatattt 162540 
S" ? ; ?taa 9 a C L CC : 9 Cagctggttc aaaaa atgat tccgtagta'a taatcacatc 162^2 
latSSS?? c^ aa ^ 9 , 9 aagcct ^ c^cgaaaag ttaggtcatt taatttgttc 162660 
aatgcagttt ctaaagtttt caagaagtgt attagattta aaagagactt gaaagggaat 162720 
ctgatacaat gtcattacaa ctgagaattt tctcaccaga attcttgaaa gtactgaagg 162780 
ItlttltT agtgaCa " C t^ttacaca tgtagatcag aaaccaagaa gtcta?gag? 162840 
aacgtttgga aagaacaatg tttaaatatg aaaaatgatg tgactcatga ttgaaaaaga 162900 
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aagtcatgta aattggaagt gcaaaaactt tacaggagaa atctatgaga ccaaaaatag 162960 
gtagtaaaat aaaaatgaaa gtaaatatgg ttgcctaaaa cacattaaca aatgaaacaa 163020 
atagataaaa ccaatgagaa tgggcagcac taagagtaac agagaagttt gacccaggca 163080 
attttgatga attacttggg aaatctggta ataatcaggt aatatgggag cagcatgatt 163140 
agttaataat tccactttaa aaaattcaca cagaaaagga cagagtatat atggaagaca 163200 
tgcacagctt ctatattaat gtgacactgc tctgagtaat taaacaccgt ttccaggaaa 163260 
gatttgttca gttgttccta gagggtgtgg atacatccag gacattcagc ttcatgactt 163320 
ctcatttgtt ttagttttta tcttttaaat tgggggaaaa tgatgtgtta ttcactgatc 163380 
ttttcttctg agatgccaat ctaaatctgt tatcttatta aacaataagc agattttgtt 163440 
ctgatttagg caattaccat ggttagtaaa caagaaaatt accctaacaa tggaattagg 163500 
ttttgtggac atgatcaggt catctttttg aggtatctgc cttttcctgc attcataata 163560 
accactatta ctctgattac ttttatactg tccccataag tagtggagtt ctcacttacg 163620 
caatgaaaac cagaattcag aggaaatgta accttctgaa ttcttattgt gcaaatttcc 163680 
tcttcattca gtgccatgct cacctggaag aaataatttg tcaaatattc ttgccaacca 163740 
aacagcagtc atattccttc agagaatttt cacactaaaa acaactatta gttttacgct 163800 
gacagcacaa gtcaatccta atgtctttgg atcatgaatt tatctacaga tttcagaaga 163860 
aacaagacaa acatttttgc caatgtcatc catcaatcaa gtggcaaagc agatgtgtgg 163920 
atttcaatgc agatagtttc tgtccagctg tctgctctcc atcttggatg ttggttttgt 163980 
tgtagggctg gctctcctca taaacttgag atggtcgcta gcaccttgta agggtatgca 164040 
gttcttcctt ggcatccact gaaaaaagag tggatatctt tttcttagga aattcatagt 164100 
tgggtattca gtcgcatgtg cattcttaaa caatcactgt tgccaataga aaacaatata 164160 
ttaactggct cacctaggat cctatacttg tagacagttt cagccaaata caaagatctt 164220 
aagaaaggga ggattggttc ctctacctca agaaaaatta aatcaatgtt gatgcaagag 164280 
tgggaatgca tgttgagagg caaacagtaa acacaacaaa atgataaatc agatctgctg 164340 
gccctcaata ttttcttttc tttgccatat cagaaaatct tctatgattc agagctcact 164400 
cattaaatgc attgcccttg gataaaagcc attggtaaaa taaccaacaa gtcatcctcc 164460 
tttgaacctt tagccatatt catgtctacc tacattatgt atatttcttt gcttcccaca 164520 
ttccatcaga gtaaaatact tcgcaagagc cagccatgtt ctaggacata tggacatgaa 164580 
cagatttata ggatgtggtt ttctcatccc aactcttgaa ggatgaatgg ggatcatcca 164640 
tgcagatgat ggagaaagtc ttcaaagaga agtgaagatc aattctaaca agacgataac 164700 
taaaacagca tagatgttca gagaactgca agaagcctca ttgcttaacc caaaagnatg 164760 
agagagaaga ggcaatgatt ggtagaaatt agatcttcaa ggatcttgta ttagcacaga 164820 
aaaagctgaa gatgtatttt atcacgaata gagtaccaac aaaataattt tcatagatga 164880 
gcagcatttt taccttattt atatgagaat tccttctggc cagagtgtgg catatcactt 164940 
gggaacagga agaatatatt attgccttta gtgaggcaat gctgaggcta ttatattatt 165000 
gcctttagtg aggcaatgct gaggctatta ccaaagtcca ggctagaaat gaaacccaaa 165060 
tgaagaaggc agaagtaggt ttcttgaggt gaaaacagag tcaggtcctc cttaatacat 165120 
aggctcttac gactgggaga taattaaatt tagtggatgg ggaaatgaga agtgggggag 165180 
catgacctgg ttttctgact tagtgtcata gtgtgtggaa atgcattaag agaaacagaa 165240 
atcgtacctg aaggaaaatg cttgtagagg aaaagagtca ttttaaatat gataaatttg 165300 
atgtgtacga gacatttcgg tgggcaggtc taacaggtaa cttacaatac atgtgtgcag 165360 
agccctgtag aagaatctag gcattcagtg tctgctccag ggtcatcagt atgcacgtgg 165420 
cagtcaaaac aataagagtg gctgagatca tccagggaga gtgttttgaa tgagaagtaa 165480 
gcaaatagaa gacatttgga tgctgcccaa tatttaatga atgaattaaa gatacagagc 165540 
ctgtcaagga gctgaagaat gaaaggtcag agaaatagga gcatatttga ggagaagaga 165600 
gtgatcagcg gtaaaattga atgaacagac accaagaaag ccaaagccag ccatgtagaa 165660 
gagcttcaaa ttagaaattt tttaagaggc attcttaaat gaagactgtc ctttattttg 165720 
ttttgaaaga tgatttattg attctttaat cactcaatat tgcaactggg atttcttcag 165780 
aggtctccaa gtctcctcta ttttcacaga tcaaaattct caggtgcaga gtatgtcagg 165840 
acctattcaa ggttacacat ttggctgaca actgtgtcaa gactaaaatt tcatgatcgt 165900 
tatttcagtt catctgaaat aaatatcctc caaacataaa ccataaactt tatgttttaa 165960 
gagcacatac acctctttac atgaagtcct ccagctttgt aactcctcca ttcttatcag 166020 
gtatcacttc ttcagacatc cctctcttca gtcattcctc tcttttcttc tcagcctgta 166080 
tatcataaat acctatcatt tctctcttgt tgtataattg actccatcat cttttgttct 166140 
tcatttctac ctggccctaa aacctacaca tcagatcaat cctatgttct ctctttttag 166200 
gctccatcct tgtattgaac acaatgctgt aattatgtga attggtctaa tttgaattct 166260 
tgccatctaa atccttcctg agcccacagt gctgtagaga aaatatgttt ctattctggg 166320 
tcagcatttg acgtgctttt tctgtagggt tatacattcc ctgattatta aaagtttatt 166380 
tccaagctct cctttcctgt gtattcagaa tatacctcac atgctgcttc acagagacag 166440 
tcaatactat cagctagtag ggcatacaac tgtggccctg tcatgcctaa atcatatcaa 166500 
tttacatcct cccttctcct tccactttat tccccttttc tttgtacaat gtcaatctgt 166560 
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ttcaaccttg gctactgctt cagacatctt gcattgtcag tcattatcct tctcttgtgg 166620 
atctccaaca tcttacctct gccaaatctt ttcccttcat cagtaaccat aatcagactg 166680 
tatttttaga tgatgtaaat ttctataatt ttatgttagc tctggcttat ccctctttct 166740 
tcctactcct cctcttatag tcgtgtattt aggtcacaag tacactctta aacaataact 166800 
gttgccagca gaaaacaata tattaacagg caattatcaa gtatgaactc cgaatttcca 166860 
acagaatatc cttgatactc tgtgtatgtg tgcttgtgtg tgcgtgcgtc tctctctgtg 166920 
tgtgtgtgtg tgtgtgtgtg tgtgtgaaga caagtttatg acaagaagga gagactcaca 166980 
gaaggtcatc aaaagttaga aatttaaatg agaatacgcg tgaaagtaat tttctagcct 167040 
gcacctgcca aagtgttttc tcatagttca tgtgaagaaa taccaatagg ttgttgtcgt 167100 
tagtccacta cacagtgttc acctttaatg taggtggcat gggcctaaac aaggatggaa 167160 
actgatatta acagagtgtt atacagcagg tgctgtacta agggctttat gaggaagggt 167220 
taagttagtc atataataat atacttaaag ctttaatttc atggtgggat agttcactct 167280 
catgccatca taattgttgt acaatcctga attaaagaac ttttcaagat gttattaatg 167340 
aaaaatcttt actgagtgaa ttctatctga ccctcccagt caaaaggctt ctactcaaat 167400 
ttggctaaac cctattagtg atgtgggttt ctgactgctg ccgtaacatt tatttggaga 167460 
taagatgtgc tatatattac agatggactt tgggaagtga ctgagggtga cagaaattaa 167520 
ttaaccacaa gcctccaaat gaaattttta tgctggtgag taagaaatac aaatttctga 167580 
aacacagcgg tatgaaataa tgcagtatca atggttaagg aagcttccta ggttttgaag 167640 
ttctatatta aaagatagat atgtaacaaa cctgcacgtt gtgcacatgt accctaaaac 167700 
ttaaagtata acaataataa aactaaaaaa aaaaacaaaa aaactttgat aagagagtac 167760 
atcttattaa tgatttcaca atgatgccta atataattta gtgtcttggt aatttaggta 167820 
atgaataaat atgggatttc ttttcaaaaa atataagtaa tttttaatga aaataatatg 167880 
cagaaataaa tacggggtaa agtttaataa ttaacaaata ttccagaagg ccagtagtct 167940 
gcgttggtca gaatgaaact ttcacagagg ggagagtaaa agtgaagcag ataaagtaat 168000 
gaaataattg atatcagtgt ctctaaagta agaccgagga atttgaattt tgtaggcagt 168060 
gaggagcctc tgaaatcctg agaatggatt caacataagc aaaatcgtat tttggaagca 168120 
gccagctggc atgcatgggt ggttggcgag aagtctccag tgagcaggtc attgcaggtg 168180 
tttgtctcat gtactagttt aatgtgccgg caggggggtt gctgtgaggg aatttaagaa 168240 
gcctgaatca gcagttctta gcaactacat gaggggatta agacatgtaa aagaaagtca 168300 
actgtgactc caagatttcc ctgaataagt tatatcttat tccaacgcta ttttgaatat 168360 
ttccagggtg tatattctga atatattctt gatatattca agattcaacc tgagtaagtt 168420 
acatagacct gatgacattc agtattcgag gtcagataac aatgtaactg aattctcctg 168480 
taaagagcaa aaatatgttt taattatttt acaagtactc aacatattaa ggaacttatc 168540 
tagagtttcc tggaaactca agtcacggct gttgttgatt ttattaatta ggaagtttac 168600 
atgaaccaat atagaactac actgctttat atgtataata ggtcctattt ttgaaatggc 168660 
tgatacagct ttttaagtgg tttttaaatt gagacaactt gacatctagc ttttaatatt 168720 
taaaggaatg aactaggata gctattcagg tcatttgaat attttgtgca aattctcaca 168780 
ctcattatgt catgtctaca ttcttgctgt ttcatataac tcaaccctcc ttttgaatat 168840 
catgatttaa gcactcataa aatattacta aagaagcctt gaggctctaa gtaacgcaat 168900 
aattagccct cctctaatag cccaatagaa tacataaata attttctccc actatttcct 168960 
ccccagatat tgaracctca atccgagttt taaaaactga cctatactag gtactagaga 169020 
tctctattgg agatgctaat agattgtaaa ctgcatctag aaaacattaa accagtattc 169080 
taaatggttt gagagtctta cttttattct gggtcagcca actttagatt cttttgtgaa 169140 
tgcatctttc gtctgcactt ctgctctcca aataatcccc tgtaacagtt atttttttta 169200 
atctttcaca cttttattct acatctaaaa ccttgacagg atttgttaat ttatcttaaa 169260 
aattctcatt tgtaatattt atcaaaataa attagtttcg agtaactata tcttgatgtt 169320 
ttttaataca ggtttctaac atgagactga aatctaaaat attatcatca tccaaaataa 169380 
tctgaaaaac tgaaatacaa tatatacgca catataatat agttacagtg aataaaatct 169440 
agtatttgat tgcgtaacag gtgacaacag tcgacagtaa tatattgtac attttaatag 169500 
aactgaaaga gtataattgg attgtttgta atacaaagca aaaataaatg tttgtgatgt 169560 
ttgtgatgat ggatatatgc ccccatacaa agtgctgtgc ctcatgcgtg ttatcccagc 169620 
attttcacag gcagaggacg gtggatcact tgaggtcagg agctcgagac cagtttggcc 169680 
aacatggtga aacccgtctg tactaaaaat acaaaaatta gcagggcatg gtggcaggca 169740 
cctgtaatcc cagctgttcg ggaggctgag gcaggagaat cacttgaacc caggaggtgg 169800 
agattgcagt aggtggatat tgagccactg cactccagcc tgggcgacag agcgggactc 169860 
caccttaaaa atatatgtat acaaatatat atacatatat attatacaca catataaatg 169920 
tatgtatgta tttatacata taaatacata tatttacaca cacagagggt cattatgtaa 169980 
caacatgtct tcaaactgaa taattttagc catcttgtca tataaagcct ctaacaccac 170040 
agatttaaaa tttcttcctg ggaagtaaat attaattaag gaaatcccaa cttcttttac 170100 
aaatcaaaac ccatgttctc cataaattaa catcataaaa cattttttgt tcatgtcaag 170160 
cggtcaggtg tctttgggga gtttagaatt acctgatacc tctgttcttt tatatcaact 170220 
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ccataaggaa atcccactgc attacttggt atgtgtttct gtaggcctct ttagtgtaaa 170280 
atttaaaaga agggggaagt ttaagtaata tcaggattaa tgaaataatt ttcaagtaaa 170340 
a ^ggtttac acaaagtatc ttcatctgtc agtgactctt aacagcagat 170400 
5f g i a !"^ " ctc <; cat 3 aaaagttctc cactggtcca ctgactgtga tctttgtgtc 1704 60 
ttttaccctt ttggatccag gtggaaggag tcccctcttt gtgacactga agtttcccta 170520 
gaaaaaagta ggagtcatat gatggttctt aaagtaatag acatcctttc tgctcacatt 170580 
tccttggcca atgcaggtca catggtcaag cctgatgtca tcagggtggg aggtgtgatc 170640 
ttcctatagg acaggggagc aaatactttc aatctgtcat aaaacctgaa tgaaatagta 170700 
agatatctgt taaatctatg tttcttaaac acacatgact gtctacacag ggttcctttg 170760 
tgcattattt gtaatgttaa tacataaaac atgaaggaga gtgtgtctcr ttctcatgat 170820 
"^"^ ttctctatta tcaaaatgct ggtagtcttc taatgatctt 170880 

tatttgttta tttatttttt agttattgaa taatgataat ggtgtctatt tcagatctac 170940 
aactttctga gttttaaagt catggaggaa aaactataat gaaataaaat ataattttat 171000 
ctaaaaataa ataaatttgg tgttaaatcc aagcatattg ttattaacag gtaatacttt 171060 
tattgatttg ggtgttttag gagttttagg agcatagaaa tatatcatac ctgtgtttac 171120 
lltnlt**? 9 "^""f ta 3aa<=ttcc cagacagcat ttttggagat ggtaacctga 171180 
gatcatggtg tcatggtgtc cttactaatc acttttaata ggatatttaa accccctttc 171240 
cagaaagaaa acaaggagct tccaacttcc catgtacccc atcacaaatc atccttcctc 171300 
tggactgtag taacacatct accccgtcca acacatctac cccatcctct gaggtcattg 171360 
tgttaataat gtaaccactt ctctcatttt tctgcagttt ttctccattc ctgtaagtgt 171420 
ggtgtattag tcctttctca tactgttata aattaatacc tgaaactggg taattcataa 171480 
agaaaggaag tttaattggc ttacgtttct gcaggctgta caggaagcat ggttggggcg 171540 
gcctcaggaa acttatgatc atggcagaag gtgaagggaa agcagacatg tcttacatag 171600 
tgggagcacg aagggtgcgt gaggtgccac acacatttaa acaaccaggt ctcctgagaa 171660 
ctctatcaca agacaacact aaggagatgg tgctaaccat tagaaaccac ccccatgatc 171720 
caatcacttc ccaccaggcc tcatctccaa cactggggat tacaattcaa cgtgagatct 171780 
gtccctacac acagccaaac catatcatat gtcaatgttg ttctctccct gaaaacaaga 171840 
acaaggctgg gcatggtggc tcacgcctgt aatcccagca ctttgggagg ccaaggcagg 171900 
tggatcagga ggtcaggagt tcaagatcag cctgatcaac atggtgaaac cccgtctcta 171960 
ctaaaaatac aaaattagcc aggcatggtg gcacgtacct gtaatcccag ctactcagga 172020 
ggctgaggga agagaatcac ttgaacctgg gaggcggaga ttgcagtgag ctgagatcac 172080 
actactgcat tccagcctag gtggcagagt gagactctgt ctttaaaaaa aaaaaaaaaa 172140 
aaaaaaannn gagcaaaacc ttcccctgat gctctatatc catacatgtt tttggctctg 172200 
tggaccatat ggactctatc acaattactc aactatgtga ttgcaatgaa aacagctatg 172260 
acaatataaa aatgagtggg cataactttt ttcaataaaa ctttattcac agaaacaggt 172320 
ggcccaatgg tggccggtag attgctgact cctctgctat agtctttctg ttaacctacc 172380 
ccttactgtt tcttcaaaca atctgcttaa aaaaagtttc tatttctcat ggtttccaga 172440 
ttcttacctc tgtttcatgc ttgaaaactc tgcaatttga cctttatacc caaagttttc 172500 
tgacagcttg gggcaagagt ggctttctga ctgagtctga agcattatct gtgaattatc 172560 
ttgggagtac atttccgttt ggctctcctc actttagact tcttacccca tggatactct 172620 
£ tC ? ^tgaaactat ccacccttgg ttctcttcca gtatgtgagg tttcttctca 172680 
gcatcttcat cgttaagaat gcatttaact gtaggagatt caaaaacgca acaataacca 172740 

TtaTtTrT/r ^K^"" ^T^** acctaacatc tggagttaag gagttcaagg 172800 
ttggtgcagc cctttggcaa tgtggccaga ggtctaagca ttttcctatt tctgctccgt 172860 
cttccatatc attttggctt ttgttctcat gtttgttgcc ttatgggaac ctcaggtttg 172920 
aaatgccatg gacaatgctt gtgtgtctaa gacaaaaaaa aaaggacaaa attagtttgt 172980 
tttttaaatt ctttattcaa tgaatcagaa aggaaaagtg tccctaagac cctggcagcc 173040 
atgtctcctt aggtatcctt ggcctgaatt atgccagacc tgaactttgt cagttcagtg 173100 
catgagagaa gctgggaagg ttcatatttc tagccagggg cactgcctgc ttctccaaat 173160 
tcggttttca ttaggaagaa agatgagtta aatggatatt ggctaaacaa ttgacatgtc 173220 
tcccagagtg acctgtgcag gatatttctc ctcccataac tcggtgttag tgttctagga 173280 
atttatcttt cgacttcttt ttctgttaaa ctgcatgcaa ttcctgtgca gtctcattca 173340 
ctatcttagc atgacacrta aggtgctgat gctcaaattt ttctaactca gattttagta 173400 
tcctatatcc atctcctaag gggtttcacc ttctggtttc cccacgtgta tttcaaataa 173460 
aagatagtct gaaagtaatg ctgtttcttt aaccggctat ttctcataca tctctacctt 173520 
gattgtaact ataaaaatga taatcaagaa actgaaactc aacttctcct ctcactcctt 173580 
caactgctat actcaatcaa ctgccaagtg ccatttacaa agtttcttaa catctcttga 173640 
ggctattaac gtattttcat acccactgtc attctcatga ctagtttctt ggttaaaacc 173700 
tgtattattt ctcactctgt ctattgaaga ttacattctc acctggttat tctgcttccc 173760 
attaatactg ctatcagaac actctctctg aaatagaaac catttttcta ggttaaagcc 173820 
cttcaacggc ttgtcataat taacagcagg taactcatac agtccctaaa caatcttatt 173880 
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tatcccaggg tattgctgag gacttcttaa tctttcactc agttcagact cactcttaca 173940 
gctttatttg gtttacaaag gatatttacc tgatatctac caggtatctc aagctgcaag 174000 
actgaaatca ttaccttcaa ataacctcca tatctatact ttgtccttcg tgttcccttg 174060 
tttagaagat gttttgtcaa caactccatt cctcctagaa gttttgtagt aatcatccct 174120 
gattttcttt ctcctctgct tcatcactaa ccagcctcct tttttctgat cagccacaac 174180 
atcctattgc ttcaattctc ttaatgcctc tcaaatatac tcactccgct ctagtgtggt 174240 
gattacagcc ttacttcagg ctactgtaat tacttgccta aaatatgacc agattctttt 174300 
aatcttttcc atctgttcgt taaaagaatt tattctagag gttttatttt ctacaatgga 174360 
gctacgatta tattattcac ttcttcaaat aattcaatat cttaccatta ccctctaaat 174420 
gaagtccaaa cttcttacca tagtttagaa agcttttcca gagctggaca catctatcat 174480 
agataatgct gcaggcaggt tgtatctgtg aacctgtcat cacctccctc tttgtactta 174540 
ccacactgag gatgctgggc actgcatggc ttcccctatt accatcatca tggtcctcac 174600 
tgtaactgta attggaagat tacatgtcct aatcctccac cctttcatca catcatgggc 174660 
cttctgatct caaccatgat gttcttccta tgcctggctc tgtgtttggc aaatggtaag 174720 
ggccagtctg tgcccttcca tgcatggatg actgattggg atgtcacctt tgctcctatc 174780 
tctcatccca tcatccccca tcgctgcact tgactcacta gcgttttctc ttccctaaac 174840 
agcagattca gtttgggtct tgaggccttt gcactttgtc ttctctctgt gtgcaagcct 174900 
ctgcttttgt tacactgctt gaaggattgt cacctttcaa aacaggagac ttccgcatgt 174960 
ctgagaaaga gctttaaggg ccgttgcatg gtttggctgt ctcccttttc ccttttgcct 175020 
gagaacgacg tttcaaatag aagctgctct ttcaatctgg tgaaacggca tgaggagaaa 175080 
aaccatattt aacccagagg tgaaatgcaa tgtcagtgag aaaaaagcta ccgtgagcca 175140 
ctgagatttg gggactattc cacagcacac tccaacctag ggaaatctga caaaaacaga 175200 
ccatcatatt tgaagatttg aattcaactt ctctctggtc tcttagcctt ctatcaatgt 175260 
atgctagcag agaatgctac cttcttccca cagtaagaaa agaagaagaa atatcctacc 175320 
ttctcctcag atgtaaatta tgagcttgga tttccttcct gtataagttc tatcagcttc 175380 
aatgcatgca taggtctctt caaaaaataa aataaataaa atagctaaag atccttccta 175440 
aatgctactt atcactccat ttaccatgtc cctagtgcta aacaactgtt tataatcagt 175500 
ttaaaaaaat agttttgttc tctccattcc cattttgtga gctccctcca gcttctcaac 175560 
ctttttcatt ttgtatttca tcctggcaat gcatgtaaac accctttaat gttattagtc 175620 
gtatgtgtgt ccctaaaggc aacagatacc ctttccgttc ccaatttatc agaagactca 175680 
tggatattca actttctgga ccattccaga ttttataaaa tgtgaccatc atttgacttc 175740 
agtaactaca attgcatcat tttgctcttg ttgctttgcc tctgcttcag attgatctct 175800' 
accctaaaat gttgctcctt cacaaattaa catataactg aatatgcaac attttcactt 175860 
ggtgatctca agatacatct caaacttaac atccccaaaa caaaatatca taatttttgc 175920 
ctttacaaat aaaaatctgt cactacaagt tacgtttcag ttttagaaaa tcataagaaa 175980 
ttgaatttta gaatatgcat tctcttgtgt cgggctttga aagacacaat gttttgtgtt 176040 
ttagtgtaat atttttaaga tttgtccaca ttgcatcaga tttgcccttt tttattgctg 176100 
agtcataatt tgtatacaga tataccaact gaatccagtt atataaaatt catctacaat 176160 
tattttgtgg acttatgtat ttatttttct tagggaaatg tttaacagtg gggttttctg 176220 
gtcatgtggt atgcataaat tgaaatacga aatgcagaac tttctctaag cgactattgt 176280 
tagggtctga gattttaacc tacttataag ctaacatgtt agtttattaa tgccttatgg 176340 
atgcaagcag aagacacaaa acttctgggt cagagaaaca ggactttatt atccacagta 176400 
gagcaatcag catgagcatc atgtttgact tggctttttg tatcgtccac gtcccacagg 176460 
gatgacacag agggcccaga tgggtggctg aagattacag gagaggaaga gtgatcttgg 176520 
ggaactcttc attttaagat aactggtgag caaacctaaa ctttgtcttg gataaaaaca 176580 
tttcctcgtc cctcaaagtg gctcattgca atcataaccc tgagaaatgg cccaggttaa 176640 
aaaataataa aaagtaggag agtttgatac tagatacttt tctagatatt ttgaagtatc 176700 
cagaaaacgt gtggtattag gagagaccca tggaggacga tttcacaaca gctgagtaat 176760 
tttacattcc cattagcaac ataggaaaat tatttaattt cagtaattct agtaggtgta 176820 
tattaacttg aatttccttg aagaataatg gtactgaaca tcttttcttt tcttgcttat 176880 
tgagcattca tatgtccttt tttgcaatgt gtatgttcag aattgtttct ttttaaattg 176940 
ggttacatct tcttattttt cagttttaag ggttctttac atattattaa tacactttct 177000 
ttgtcagata tgtgcactgc aaatattttt ctctcagtct gtggcttaca ttttcatggt 177060 
cttagagaaa ccttttgaag aacaaaattt tatcttggtg aaatgtaatc ttttattttt 177120 
tagcacttag tgcttttgta ttctaagaaa tgtttcccca ccccacaagt ctgaaatttt 177180 
tatcttatcc tttcttttac aagtttataa tatgattttt gcttttttgt ctatatttag 177240 
ttttgagtta tttttgtgta tggtgtgagg tggtggtatt taccttttcc aggtggaatc 177300 
aacttgttct accaacattt atttaaaaga ttgtcctatt cctgttcttt tctgattgaa 177360 
ttatgtttgt cctttctccc actgaattct gatatttctc aaaaatcaaa taataatatt 177420 
tgtgcatttc cagttttggg ctctcaataa tgttctttct ttatgccaat atcacactgt 177480 
cttgagtgct gaatatttaa attaagtact ctagccagag gtatttgatg ggttccaatc 177540 
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cattacattt attattatta ttataactat aataataatt attaatggtt gtttacagaa 177600 
ggtaggctag tcctgtagca accattttgc cataactaat agtgtgatat tctgtttact 177660 
tacaatttga tatttttctt cctttagtaa attggagatt gccagaaatt ccttactatt 177720 
tctcccattg aaatagggaa gcaaattcac ctcaccttga atgtgggctc attttattga 177780 
cttcttgtcc aacagaaggt gggacaaaaa aaaaaacatt atgttattgg acttccaaga 177840 
ctaggtcata tgaatccttg aagcttcttt ccagctctct tggaacactt tctcagtgag 177900 
acccgagatg ctatgtaagt ctaacaacca cctcactgct gtactgggga agacacatgg 177960 
aggtgttctg atggacagtt cctgcctagt ccagaattcc agatttcctg ccaagatgct 178020 
aagaaagtga gaatagccct cttggatctt ccagaccagt tcatctgccg ttagtgttag 178080 
tgtgaattct agaaatacca tgttgaaagg aaagctccat gggtaaatac gcttctgcta 178140 
ataagtggga taccagaacg gctgttagca gctgtcaggt ggaagtggtt ctcgtgtcag 178200 
attccagtgg aagttttcag tgtctctttt ccttataggc ttaggatgtt tctctactgc 178260 
tcccagcaga ggtgctcagc atactccaaa gaactctgac tttgatatcg tgccaatcag 178320 
ggttcgaatt cacaaatttc catttataat ctgtataaca ttaaatcctg ttccaatgtt 178380 
tctgtcattc agctttgtca tctctccata cagaaaaata cccacccaat gtcatcccac 178440 
ctctgctgcc taaacactgt attaatgtta agtggcttcc ccatgctatg cactgtctat 178500 
aaaataggga tagtatcacc ttcctctgga tgcattgtga aaagtcaatg tgaccatcca 178560 
cgtaaaaaat ttgatgttag gcacagagga tttgcttaat aaatgttagg tattattcac 178620 
atgaggactt gatgaaaaaa aggtaaaagc atgtatttat cacataaaag ggtcaaataa 178680 
atgccagttt ccttcattct tttcctttta gggcactttg aaaatatgga aaactagtta 178740 
caacacagcc tcaagaagct catgattcac tggttactta tacaagtgat cctaattatg 178800 
gcttgagttg tagttgtttt acaaaatgta aaaattattg agataataat tagttattta 178860 
caaaacaatt ttgctcgtag agattttaaa ttctgtcatt aaaaagaccc acaagcctct 178920 
caattgcaaa gaagttagca tcaccttttt ttttttacca aattcgaaaa aagcaataat 178980 
ctaactacag aataaaccat gggcggtaca gtacaaaaaa tgttcttatg tttttaaaag 179040 
ttgtgcagtc cattgatcag gcaggcagtg gtggggagaa caaagccaaa cactgaatac 179100 
acgttttgct acaaagaacc tagtctaatc agttccattt agaaatgcac tttcatttga 179160 
ctatatttgc aaatcatcat ttttttttct ctaaaaataa tatggagtgt ggaaatgctt 179220 
tcctatgcag aaaggttttt ttatttattt tgctgagttt caggaaatga agcttagttg 179280 
ctaccttgca tagaaacacc aaactataga atgcacaaat ttagcacaga gtctgccacg 179340 
tatggcaatt ttgataccgc tacctacaat tcacatttac tgcatccatc caattacacc 179400 
catggtatca atgcacagca cttagagcag gctggcagtt cacaagagtg caacggatgt 1794 60 
cacgcagtct gcatatcagt caatctgctt cctgatatgc tttaagaaag cctgttcctc 179520 
agactcacct gaaatgcaaa gagagaggtg gaggacgatg gcccatgttt cctcttgtgt 179580 
acctacactg cagttcttgt tgatgaccat ctaatgcaaa ctgtcccaca aatctttcaa 179640 
agagagtaaa ggtgaagcca gtccctatgt taacaaacct caaccttatt tcagcagatg 179700 
gtaaactttg attttgttta agaaccaaag cttcaacagc ttcaggaaaa catgtgtttg 179760 
taattcatct ccattttaat gccacccttg catggacgag ctacatctcc tgctccaggt 179820 
gaatgggttc ttacgttctg gacactgtgt cttcatttca gttattgata cctagagttg 179880 
gtgagtttca agccaatctc taagtcaaga gtttatttgg gagttgggtt tccttttgaa 179940 
atatactccc tggagcctca ggctgttgcc tgttcacctg tttccttttc agaccaaatc 180000 
atggaaagga ccaggttatc tctgatccat ttggggtgtg taaatatcaa gaaggataag 180060 
gtacctttcc tggacttgag aacttatagc ctaataagag aagacaggaa taaaaaaaga 180120 
aaagttttag ttgctatgag actggcatat aaaagtacag ggacaggaca aagaaaaaaa 180180 
tggaaaatta tcagaaatgg gtataaagat ggcttcatag aggaggtaaa ctctgtctga 180240 
gaaattagaa aatgcccaga aagacatcaa agagaaggat aggctgaaag gtggaaggca 180300 
taaatgctgt taaaaaaggc aaagagtcat gaaatagaca atgaatttct agaaactaga 180360 
aaactggaca caatgggaca gggcatggaa tacaattaag tgacctctga tttaatgttc 180420 
aagctagtat caaaggagat gctgtgaatt ctaatcttga aatagcacac ataaaccaga 180480 
gctgccccag gcaaattgtg acatatggtc tccctaggca taaagttaat ggtatagtca 180540 
gaaggttaat aacgtaggaa atgaattgaa atggctgctc tgacacacgc tacaaagttg 180600 
gaacctgatg caatgtggta ggcagaattt tgttcactat gactttcacc cccggtgcca 180660 
gctttgtgat tatgttatat gcatggcaga agggacttgc agttgtaatt aatgttgcta 180720 
ttcacctgac tttaacagag ggaggtgatc ctgggtggtc caggtgggcc cagtgtaatc 180780 
acacgagtct gtactagcag aagcagaagg cagacagatn agggaaagag gcaggacaga 180840 
gatatttgaa gcctgacaag gacttgacat ggtattaata gttttgaaga tggcggggaa 180900 
cataaggaag ggggta.cagg tggcctctaa aagctgatga tgacctctgg ccaacagctg 180960 
ttgacctgga gaagaatgtg tgttaaacaa gtgtggatgg tgatgataat aatgaagatg 181020 
attttgctta tatgctgaga agacatcatg ttaggagtga ttacagataa tgcagcaaaa 181080 
aaggagcaaa gtgttgaggt agaggtgaga gaaggggtga gctcaagagg caggcaggaa 181140 
tccttagtca ggcacagtgg ctatacattc cttttcattt ctccttggac ttgcagatgc 181200 
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gaaacagatt ccttttctca gatgcccatt cattaggggt ggccatgtga ctgaattcca 181260 
gccataaaga tcacatgtgg cattttaagg cagaagcttt tcccttctcc cagcagaatg 181320 
gaaaagctga agcaatagga ggagatgcac catacaatcg aaggaagcag agaccctcaa 181380 
tatggaacaa cgagaaacat ccctgcagta atagttgaac aaaaacacac tcccattgtg 181440 
ttgtgccctt tcttgacatt ttagactgtt cattttcaca gctggaccac tgcaaccatc 181500 
acacatgtga gaagaaggtg ggtctagaag gaagaaagtg agggaagcta caaatattca 181560 
tgtttgttag tggggaccat gaaggcgaga tagatttcaa tttcctcggc cgggcatggt 181620 
ggctcatgcc tgtaattcca gcactttggg aggccaaggc gggcaaatca cctgaggtca 181680 
ggagttcaag accagcctgg tctaacatgg tgaaacccca tctcaactaa aaatacaaaa 181740 
attagccaag tgtggtgatg ggcgcctgta accccagcta cttgggaggc tgaagcagga 181800 
gaatctcttg aacctgggaa gcggaggttg cattgagcca agatcatgtc actacactcc 181860 
aggctgggtg acagagagag actccatctc aagaaaaaaa aaaaaaagat ttctattttc 181920 
tccataaggt agtggaaatg ggatctgctg acctcctgag tgtgtaggag tggaggtgga 181980 
gaatgggata cccaaaagca gaggagctca tggaggacaa gaactgtgag gactggaaga 182040 
gaaagcatgc taggggtagg attgcagaat agagctcaaa cttccaggtc aatgatgagg 182100 
gttcagcagg acttggacac tgggcatttg cagtgactcc atgtctcaat attgtctgta 182160 
tagtgaacag atgctttgaa tgtgggagac tagaataaat gcatttttat aaagattttt 182220 
gattaaggca ttagtaaata gcttcagtag aactgtttac ttctttctac tgcaaagata 182280 
tcccaagggt ttctagcctt catcctcagt tgctttctta atagttacca tgaatatacg 182340 
taaggccatc cataatttca cacctatact tatcatctca aggcaagcct tgttagacag 182400 
gcataagctc ataataaatt acctgatcca gataaggctt ttttttttgt ttccccaatt 182460 
acactcatca accacaaatt aagattaatt acatagatta cttggtctgt aaacatttaa 182520 
ctaccatcaa tggtatgaat gaccatgtta aaggaatttc ctaagtgcaa tagcagagta 182580 
ctaccattcc ttgagtatgg atagtcaata ttttgtaaag ctataaatac gggaatctgt 182640 
aaagaatata ttctgtatgt ggagtccata ccaaatctgt tcaaaatgcc tatgttaaat 182700 
gtaattagta tattctgatt tataatctac catataaact cattaaaaga aacccacaag 182760 
cagctcgcca ttaaaaaaaa aaaactgtta aattgatgga aatctggtac ttatttttta 182820 
ggcattatgc agtgtgcttc attattatac acagattctt ctcactatag tatcgtttct 182880 
gttaaaagct gtgtgtgtca ctcatacagg atacttacag agcgttttta aagcagtaca 182940 
taaatacatg aattatttcc aaatgtgagt taaattgcaa gtcacaattt agttaagtag 183000 
attagtgaat agaaataagt gatttatttt aatagccaaa aaacaagctt aaagcaggtg 183060 
aaaggcatag agcagacaaa ggctaaattt tatggagcat ttgcacattt gcaagatttc 183120 
attaaatttt ctcaacacaa ttttcctctt ctagtaatat gggagactag atattctaac 183180 
tggtctttct actaattaaa aacacctaaa aagtcttggg aaatttgttt tataaatgaa 183240 
tcaatgcacc agaaagaaag taagaattct catttgaaaa ctaaatagag aaataaacct 183300 
tgagaagaaa gaagaattct gaaacaggtt tgaatttgag gatatcaccc aaaaatcttg 183360 
acttgcaaat ttgtttgcac agcctctcag agtgcaggaa acaggaggac atccaggtcc 183420 
ctaaaaagat gtgagttgat cataagtact cttgatgagc tggaatgatg agaaggctac 183480 
actcttagca tagctgtgag ccgaaactaa agacaactcc tttctccagc tgatacaagg 183540 
taaattgtcg ataaacttga ccctgaatgg aaggtaaaca cgtaaataca cagagataca 183600 
aacatacaca caaacctccc atcaaccctt taataagaaa gaaaggaaaa agctagacct 183660 
ccctccaatt tacacttaac gcatggtaga tacagatata gacatggata tatatcacaa 183720 
aaaacccaga gtttatttta taggcaattt acatacggca aaaagaatac aaatcttttt 183780 
ggagaatttc agcttcatta ctaaactcat ctctagagaa tgttactaga aaaatgagca 183840 
ctcaccagca gatttcacta aagaatcaag aaagaaagaa ccatgaggaa gaacttgcaa 183900 
acacaagagg catcagaaag aaggctgttt agtaagaacc cgtaggaatg taatcacaca 183960 
tggaatataa aatctatatt taatatcttt aaaggaatga aagactagtt ttaaaatatg 184020 
atcatggaaa aagaaaacat aagaataaac atgatgattt ttaaaagaag aaccaagtag 184080 
aactttcaga tataaaagaa ccagtggtcc tgggtatgac cacagcaagg acagagctga 184140 
gtgccttagt gaactgaggg acctgaataa attacccgga aggcagtgca gacaagcaag 184200 
gaaataagtg caatgaagga caacctagca caggtggaag aaagaatgtg aaggatgaac 184260 
atatctctca ctggagttct gaaaacagaa gtggcaggaa aatataaaga ttgtaactga 184320 
ggattattga tgactgatga atgatgacaa ttaatacatt catgaaaaca tataaatact 184380 
aaccaggata gttaaattaa aataaaacac tcacacttta acacacctta tgataaaacg 184440 
ggagaacacc aaagtaaaaa gagcattgta aaaatagtca gcaagaaaag acagattaaa 184500 
tcccagagag tgacagttat atcaactgct aactttccaa gagcaaaagt gacattagca 184560 
gaaagtagag tgatgtttgt tctgtattgt tctgaaagaa aataactgcc aacctacaac 184 620 
actattgcct gtaaaactta caagaggccg ggcgcggcgg ctcacgcctg taatcccagc 184 680 
actttgggag gccgaggcgg gcggatcacg aggtcaggag atcgagacca tcctggctaa 184740 
cacagtgaaa ccccgtctct actaaaaaac acaaaaaaat tagccgggcg tggtggcagg 184800 
cgcctgtagt cccagctacg cgggaggctg aggcaggaga atggcgtgaa cccgggaggc 184860 
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tlnntltl^ S^agccgag atcgcgccac tgcactccag cctgggcgac agagcgagac 184920 
5S??t«« aa'c^T 3 — aactta caagacagga gaaacccaga 184980^ 

JllllTxnn t a f caa l cc tgattaaatt aaacccaaaa taggagtaga gaaataagaa 185040 
cttagagagg actaaaatgg aagtggggag ttgagagcaa aaatctgttt acatggg?aa 185100 
ataaaaatca aaataataca aaaataatca tttatcttgg gataaaLaa aactlaacaa 185160 
gatagactaa attatgccat atcgatacaa tatagcctgg ggaaaatgtg aagataaatt 185220 
tcctaaattc cctgtattgt ttagggagaa ttaccagttt ttaaattat? ttLaactta 185280 
ggtaagttaa ctgtaataaa aaaattcctt gggtaaccga gtagttttta aggaactgaa 185340 
tataatgaaa atacaacata taattcccaa aaaagtaatc acaaattaat gagaaaaata 185400 
a ^ aa ^ a '° tgaatctata agtaagaaca tcatataaag gtgggataaa tagaaataaa 185460 
aactaatatt ctaaaaataa gtttagtatg tgaataatta gaatatgtat aaattaaata 185520 
aatactccag gtaagaaatc atcgatatac agcctccagg gtcttagaag gaaacaaaca 185580 
gcacacttaa ctaggacaac ctggtgacag tacaatagag aaaatatttt taaaagtata 185640 
gatataccct gttttataac aataagaaga ttctcagggg agggtataca aaatatacca 185700 
catgcactgc caatgaaagt gtaaactgtg taaactttct gaaaacaata caccccaaaa 185760 
ata^r 3333 ^ taata r C " 39gtCcta agaaaagtga cactaagaaa atatgtatct 185820 
tTr-tl * 9 " attattCC atgatttata attagcataa atgtagcaac tacttaaatt 185880 
cltlTnrtt T^t^ atttaaaaaa ttcattaata gcatggggca atactcattt 185940 
rttnnllZ atgaat <; aa 9 aataaaattc aaagaactcc aactgattaa atagactccc 186000 
tlttlttr.J agtatC ^ aa ^aaacccaa aaaactagtt cagaccacaa taggaagcgg 18 6060 
??" aaaCat g ^ ctcctta t acccttccct ttggagttca ggcatacaac tgactgacat 186120 
: aa : a " aaa a t a 9agatca gaagactgac aacacagact ttttgtaaca agaagatacc 186180 
aaattctaac ctgagtctgg tatagcatca catgacagat agcagcaggc tttgaaggaa 186240 
SaactattL ttctttg.ca tattttgaaa tgaLctJaa S 

aagctgtttt ttgtagagga aatttgcagc ccgtaaataa tctccttccc ttactagata 186360 
ttttctagag tctgacatcc tttaaggtat gataggaaat acatatgatc tattgtctct 186420 
gaagcctgct acctggagac ttcatccaca taacaagaac cttggtttcc ataatccccc 186480 
tittSltlt ttttS^ "ttcttttc ttttcttttt tgaatttcag gctaagttta 186540 
allxUtt, "tttttatt ccaataggtt tttggggaac agctggtgta tggttacatg 186600 
aataagttct ttagaggtga tttctgagat tttggtacac ctatcacctg agcagtgtac 186660 
actgtaccca gtgtgtagtc ttttatccct caccaccccc tattctttcc cccaagtccc 186720 
caaagtccaa tgtatcattc ttatgctttt gcatcctcat agcttagctc tgacacatga 186780 
gtgagaacat atgatatttg tttggttttc tatgactgaa ttacttcact tagaataaL 186840 
f^ttlt I CCatCcaagt tgttgtgaat gccattattt cattcttttt tatggctgag 186900 
tagtatatat gccacaaata tatacatata ctaaatatac atactatata tatatataaa 186960 
tatagtatat aaatatatct aaatatagta tataaatata taaatataat atataaatac 187020 
atactatata ttatatatac aaacatatat acacatatag tataatatat atactatata 187080 
tattatatat atacacacat atatatatac acaccacatt ttctttatcc acttgttcat 187140 
S"' 'I? 9 ? 9 "??* tccatatttt tgcaattgtg aattgtgctg ctagaaacat 187200 
M gtatcttttt tgtataatga cttattttct tctgggtata tacatagtag 187260 
Sactfat T^tt 9 gtagttctac ttttagttgt gtaaggaatc tccataatgg 187320 
► ►►^f accaacactg "cccttttc actgcatcca cgccaacatc 187380 
^aaat™' ttttl \ tl * 9 "\ tggCCa "cttgcagg agtgaggtgg tatcgcattg 187440 
^tlll?* 1 - 1?+?? ctgataatta gatgttgagc atttttccgt atgcttgtgg 187500 
3 tattttcttt agagaattgt ctatacctcc ccttaggctc cacttccaa? 187560 
attgggggtc acatttcaac atgaaatttg gagtttggaa acatgaagtt tggaaatatc 187620 
aag = aggatt atatagatac aataaagttc aatgtatttt tcaaggcact 187680 
tatltltlll a tTt? ? atttatatta tttatatata aaaataatat attatttata 187740 
lt ta lt , atata «ata tatataataa tatatataat atataatata taatatataa 187800 
tataataata ataaatatta tttatatata taaatttata tatataaatt taatatgcta 187860 
tataaatata aatatataaa taaagtatac tccacaattg cctaatacag atttctttta 187920 
caaatccata ggacacctat aaaattgaca gatataaaat gtgtttccat ggtatttaaa 187980 
agatttatgt tatgcagata actgcctctg aaagcagaaa cattaatttg caaataaata 188040 
acaaaaggac aactagttta cagatcaatg aatgccctct taaaaaagtc ttgggtcaaa 188100 
gaagaaccca aaatagatat tagaaaatat ttttaaaaaa tagataatga aaatatatat 188160 
taagcctagt aacttacagt cagacagaaa tggatttaat gcagatatta gaaaagaaaa 188220 
tactaaaatg aataagcttg gcatgtattg aagaagttat aacaagacaa ggaagaaaac 188280 
ccaaagacag aagtaaatct catggtgaac tgtagttccc ataatcccca caactagtgg 188340 
gagggacctg gtgggaattg aatcatgggg gcaattacct ccttgctgtt ctcatgatag 188400 
tgagtgagtt cttgtgggat ctgatggttt tataaccttt gctctgcact tctctttgct 188460 
gctgccatgt gaagaaggac atttttgttt cctcctcctc catgatcgta agtttcctga 188520 
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ggcctcctca gtcatgctga actgtgagtc agttaaacct cagcctcagg aatgccttta 188580 
ctagcagcgt gcgaatgcac taatacatcc catatctcct taaaatgtac aaaaccaagc 188640 
tttgcgctga ctaccttggg cacatgtcat caggacctcc tgaggctgtg tcatgggtac 188700 
atccttaacc tttgcaaaat aaactttcta aattgattga gacttgtctc agatactttc 188760 
tgttttacaa attacctttt gttctagacc actcacaaga gtgaccatgc aaaatggaag 188820 
tgtgccactt actcagctag aaacgctcct tagcttttcc tctcactggg aatgaaagtc 188880 
aatattctta ccttgacctg tcaggccctg tacttaagaa aagaacagca tgctttgacc 188940 
taaaattacc tctgaatggt ctttgtctgt aaaatggaag agacatggag aggcgccggc 189000 
acccgaggtg ctttctaaaa taaggggtgc cttaaaagtc gtgttgaagt ttgtcttcct 189060 
ggaataggaa gtgggtggaa tattttaatt agggaggtga tgtgattgaa tttgtttata 189120 
aaattatttt tggttaaatg taatagagaa ggaatttgat acagtgggta ataatgaaga 189180 
ggaaacaatt ttaggaatat taaaatatcc caggcttgaa tgaacgtaga gatggagaaa 189240 
tgtgtaaaga ctgaagaggc tgttgagagt tagatggaaa agacaagtgg acggattagt 189300 
gttgggataa gaaaggaaga aaggagtcat tgatggggcc atgtttcaga aggtttaggg 189360 
gtgaaggctg ctgcaattca ttgagaccag agcctttgaa agatagataa gcatgcttgg 189420 
gtgcagtggt tcacatgtga aatgtcctac gaaacataaa agtgaaagga ggagacaaat 189480 
ataataaaaa atatatattc ttgatgtcaa aactaaaata taaatatttt ggagactttt 189540 
gtgtgtattt ggatgctatt ctcatgctct tttttctgga aaaaataata caaaacaaaa 189600 
attatttcac gagttctcaa tttttgtcct tctgttttta tcagataaat gctaataact 189660 
ttcatatctt tcttctcctg cagtttccca tgtcttccaa atttatttca catcattatc 189720 
ttacttttcc tcaaggccgt tttttctctt gactacagca atgcagattt caattctgcc 189780 
attgaattcc cagacatatt cgtcatcccc attttcatcc cccaccaccc tgccattttc 189840 
ttcgtgttaa cttgttttcc tgactcacag aaatcacctt ttcctgtata catttttagg 189900 
atgtcagact ttattctaat gatttctcct agttgccccc caaaattgta ttctacrgtg 189960 
tgattttaaa gctgaatttt caagatgata tttcatatct atattttcac aagcttttct 190020 
tctatgaatg ttattgtcag ctgtcagggt gtgagatggt acttgatact acattctttc 190080 
caagctgttg cctgaatcgg tttaagacaa agtcattact aggctgtaaa ctgttgctct 190140 
gcaaaattga gcagcacgta tttaaccact catacttctt agctctccaa cactttgagt 190200 
caatagagta tacaaacacc tgtaatttcc agtgagcaca gctcccaggc tctctctctg 190260 
tctctctctc tctcttttgt tttttttttt ttgcctcagc ctctcgggct ttatttgtgg 190320 
gatttgcacc ttcaccttct gcatgcagtg tgccatggct agttcccctt acatttttgt 190380 
catgattttg gtagaggcaa tgatttcctg gttgtagaaa aaagggcagc aacataggat 190440 
tacaaaagta ttgctaagaa attgcaacaa gctacttgca gtttatgtat gattactcac 190500 
agtttttgga tggtgcaacc cttaatttct gaaccagcag gtgaaagatg agagaagaat 190560 
aagagacctt tctatagatc ttcaggcaga caggcttgtt ggcttcctaa tatttcggtg 190620 
ggatactggg atgggttgga ggcaacatac tgtttaaata agtagacagt tttttaaaaa 190680 
taatgaattg tgtctctaag atatattttt accagaatga tactatatag gaatatgcgt 190740 
aaacccctgg actttgatct caagcaaaac cttttcccta ttatccctcc ctcttaagtg 190800 
catgagtttg gtaaagtact acagaaattt gttctctcac agttctggaa actagaagtc 190860 
tgcagtcaat gtgttggtgg ggttggttcc ctctggagaa tctgttctgt gctcctgcct 190920 
taactttagg gcattgccgg caacccttga cttgtagatg catcacttca atctctgcct 190980 
gctccacaag gtgttgtcca ctgtgtctgt gtccagtgtc tttgtccagt gtctgctttt 191040 
tcttctaata aggacattag ctatatgcac tcaccctaat taagtatgat ctcattataa 191100 
cttgattgta tctgcaaaca ccccatttcc aaataaagtc acttttccta gggttaggat 191160 
ttgttttttt tttttttttg gtggagggag aaaggaagga agtgggcaca gttcaattca 191220 
taaggaacca actaaatata aatgccactg tggtaagatc agcttgacca tgtagatttt 191280 
taaatggtgg ttttctttgt ttctattgcg ttttttgctc atcttatcag aaaatttgat 191340 
aactttagat aataaatatg aaggagtaaa tcattaaaaa tttaaagtga aggccaggcg 191400 
tggcggctca cacctgtatt cacagcactt tgggaggccg aggcgggcgg atcacctgag 1914 60 
atcaggagtt tgagaccagc ctggccaaca cgatgaaact ctatctctac taaaaataca 191520 
aaaattagcc gggtgtggtg acaggcacct gtaataccag ctactcggga ggctgaggca 191580 
ggagaatcac ttgaacctgg gagatggagg ttgcagtgag ccgagatcat gccactgtac 191640 
ttcagcctgg acaacaaagc aagactctgt gtcaaaaaaa taaaaataaa aataaaaata 191700 
aaaaaataaa gtgaatccca cagatttgct tatattcttt ataactagat taagtgtcta 191760 
caacagggct ctttttgact aactttatgg ggatatacaa caggtgccaa ccactccaca 191820 
ttccaaaaga aggcaaagac ttgctttgca ggacttgcaa agaggtgtcc aaaatgtttt 191880 
cctggcctta ggctcagcat ccccccgaag gttatgaagg aagagctgtg gtctcccctt 191940 
ggagagattg gatggatgct gagtgctcat cttcttcccc ttccctgaca gacaacttga 192000 
tgtgcatccc tggacttaca gggacaaaga agtgtccgat ttattccaga aaatatttaa 192060 
aggatgagag gcttgcaaag aggcccagag caaaatcaaa gaaggctata gcctgaaaat 192120 
aattttattc ccccgatagc aggccagatt gcatctttct tatcactctc tggtggacat 192180 
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tgtgagacag agcttagcca gggattatct tagatcttgt caaagaaggc cctggccaat 192240 
gaggcgatct ctgcagaaag cagctggaag tgcatcatcc cagatcccag gttgagagac 192300 
cagcctcagg agaacagctg aggttgaggg attcattcta cctgggaaaa ctgcatgtga 192360 
tggataacca ttcgtgggca gggatctcaa agaaccccca agtatgccct gaagaattca 192420 
gcacccaacc tcctattact ttcaaaggaa tcaaacaacg gtttccaagg gggcatgcca 192480 
ataactttac tagtgccgtg ctctcccttt ctctgtaaca cctctcactt cagtcctccg 192540 
tgagccatag tgtttcaagg gtgtgtgatg aagcgatgga agcctcaggt ctggaaatag 192600 
gaagaaagtt tactacaagc ggcctgctga ttaaagtggg cccaaacttg tgaagaaaac 192660 
ttggatcaaa gttggcatct tgattcttat ggagtacaca gaattgcaac caataaaaga 192720 
gacaattctt gagaggaagg gtgattagag gacattttgt tacctgggag tgaacagata 192780 
ctttatgaga aatgctctat atttgattaa gtaacaggga tttgcctctc aaagtggggg 192840 
tggttttaag taggcagtct gtgggggcgg ggtaagaaaa tggatttctg attttatctt 192900 
tgctcatttt ccaggttacg taaatattaa gaacattgtt aatttacttt aataaagaag 192960 
ttcataaata gtccaatatt aaaagttgat taaataatac ttttggaaat acatgaattt 193020 
tactgaatgc cctctagcct tactacaaaa agaatgcttt gcaaataata aaagctaaat 193080 
aaaaataagc cagctccagc cactttaagt gacggaacac actctgtaac ttggggatgg 193140 
ggacagggca ggaaatgaag aaattaaagg caagggcgcc tcactgaggt atatcaaatc 193200 
ctattagacc aggtcatttc cttggctggc aactggacag ggctcctatt gtcttgtcta 193260 
ctttggctga aatcttttat ttatcctcca catttgtttc atttgcaggt aattctattc 193320 
ctgtttctac atttaccatg ctggatcagt tttttccctg attttatttt taagttacta 193380 
tgatattcgt actaaatatg taaatgttta ttattgggtt ttagctgttt ctataataaa 193440 
gagttgtagt tattgtaagt tgcatcgcac taagctttaa aaatgtattt ctccactgta 193500 
gagaagtatg ttccattgaa atttgattgc acacaaagct tgacatgagg tttattctct 193560 
ttgtgaaata aaccattcaa tctggactgc atccacattc atctgaaact tgaaggaaat 193620 
aagtcatctt tcacttatgt tttaatatgc catatttcta ttaaattaat ggaaggttag 193680 
acctaaagct caatgattgg acacttatta ttaatgattc tttgcacata gagtgcatgg 193740 
atagctcaca agaaaatttc accaaagtca tcttatatat ttctcataat aatcctaaga 193800 
ggtagggttg atagacatta gcattctgac atcagaattg gagacagatt cttctgtctc 193860 
atttaaggga tttgccccca aattaatcag tccatgataa agccagctct ataatattac 193920 
tctaaagaat attcaaaagg tcccgggcac agtggctcac gcctgggagg ccgaggcagg 193980 
tggatcatct gaggtcagga gttcaagacc agcctagcca aaatggtgaa acctcgtctc 194040 
tactaaaaat acaaaaaaaa aaaaaaaaaa tagctaggag ttgtggtggg tgcctgtaat 194100 
tccagctact agggaggctg aggcaggaga atcgcttgaa cccgggaggc ggaggttgca 194160 ■ 
gcgagccaag attgcaccac tgcactccag cctgggcgac agagcgagac tctgtatcaa 194220 
aacaaaaaca aacaaaaaga atattcaaaa aggtgcatgg ggcaaggaga ggtcatttca 194280 
catcctcttg aggaacacag gtgactccag accacaattc tcctagatgg ccgggggccc 194340 
ctgctcccct ccttttggtt ctagggactt gattcctgaa gaggggtcat gcgccaacat 194400 
ctagtttaga gtttttctat tacaccacct ggtctcttat attaatttgt ttccctgatn 194460 
ttttttttta atctcctgga tcacaatata atcgtttgaa tttttgttgt tgtgtataca 194520 
agggctccca agaaatgtgt aataaaagtg gttggtggga aaatcttgct aacagaatcc 194580 
agtagattaa taactaagaa aataaggcac acacaacaaa aggttgtgca tggctgcaag 194640 
ttcaagagtg ggaactgatt agagtggtag aataaataca tgtgcatgtg ttctgaatcg 194700 
attagacata taaaaagata tctagattct ttgaaaaatt aataagaaac tttttttcta 194760 
agcataaata tgaagttcaa gtataaactt accaaacaac agagaagtga taaagtgcta 194820 
gcataaaaaa ttagcatcat gattattctg tattgtttac atagatctat caagtcaaat 194880 
gttgggtcag agccattctt tattcagtta tcaaaatggt agagccattt ggatgagttc 194940 
agagacttct tctttatgat aaaaaagata attgtaaatc aaaaaagagt ttaaaggcta 195000 
tactaattaa acagctaatt ggaaattgga agatttgttt ctccaaactt ctactatttt 195060 
ttattttctt ttctgtttgt ggaagttttg gaaagcaaca ttttattctg tgattagcat 195120 
caaagagttt caggcatggg ttataggact ggaccatgca tacttgaagt gtacttcaaa 195180 
ccataacatt ctatgagatt tcgtgagcaa ctacctctta ttgataaaaa gaattgctgt 195240 
ttaaacccta ttttaaagta cttacttttt gctttctcaa atataaggat gatgtactat 195300 
ggattagtaa tatcattacc tatggaaaca aaagatataa aaatttactg ttcactcact 195360 
tctccttaat ttccaagaac attaatcttg aatttaagct taaacttact ttaaacatta 195420 
tctttattat cttattttct atgaagaaaa taatccaacc tatctactag tatgccactt 195480 
taaaaacatt aaaagccagt ggataactgc aaaagaaggg cttttaattt tatataaaat 195540 
aataaaataa tttattctat tgataattcc aaactctcaa ataatatctt gctgcagtgc 195600 
taaggaaatg gacaaaggaa gaaaaaacag cttttcatgc ctttcctgaa acaattgcag 195660 
gaaatttttg tcacgctaca attattcttt gatcatgaaa aattctttta aagacacgat 195720 
agccacaacc attttatctt ctgacttact aagcagtctg agggtgaata gagagtttcg 195780 
attccaggtt atggttataa ttactaactg caaaaatcct attgcctttc attaagtagt 195840 
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taaaatattt aattgagact acagtgtgtc tgcacacagc gttgtaccat gcataattat 195900 
ccctacacat acaggaacac tgagggtact tttaacaata aaacagatca tcaaaaaatt 195960 
acatttttaa tttaagtttt gactttaatc ttctccgtga ttaaatacct atctctgcat 196020 
tttttctgca aatgaggaca ttaagctaaa acaaatgaac cataaagtga cgggaaaaaa 196080 
aactttgcac tgttaatata aatggataaa tctttaatat gtattaacca ttgcactatg 196140 
tctttatttt ctaacagctt gtatgagaaa gaagaactgg tgatgcagaa agtcaaatac 196200 
cgccagctta gcgggtcagc atcctctcct cccacataag cagagatcgg agcatgagat 196260 
aaaatacctg gatgcccttt gctccagcct cgtgctccca caactttgct tctgacagag 196320 
tacttatctc tctgaccact gggaaactgc taaaagagag aaacttgaaa aggattggac 196380 
ttttctgtgc aaatctcttt gaaaattgcc acctccaggt tagcttttaa gagaagccca 196440 
atatattagg aacctgaaaa taaaggaaag tttgcacatt gaaacatcac ttgctaatag 196500 
ttctattatt ggaaacggaa atatggaatt ctagaaaaag aaatgtcatt atggggttag 196560 
gaattaacca cttgctactt atgttaatgt gacgtggttt aaaaaagctg caggaagttc 196620 
ctgaaattat tgaaacaaaa ataaagtatt ttcccctgga atctaagtaa tgtaagtgaa 196680 
attctttaaa cattctcttc aactttatat ttcaaaattg cttcacagta atttcttatg 196740 
ttatccatca cttgtctaac tggaaggcaa gcatttttat tgttttagtt actatttagc 196800 
tttatgaaca gcattcttgc tttgttgcaa ttacaaacat ttatttccat agggatggaa 196860 
agattatgct ttgctctctc tgccaggtac caattgtcaa tagaaaatag agccaactag 196920 
agtaacttta gctcttcaat caacacatcc agtattaaga aaaaaaatac attgatattt 196980 
gagacgtaaa ttcaaatctt tattcaatac gcaagaagtg taattgtggg agattaggtt 197040 
gggaaataat aaaaagctgt ataattagag tgtcaggatt tacaccctcc actggaagct 197100 
actatatcag gtataactcc gcaaaattgt agatggttgg caactaatcc ttgctcacat 197160 
ttratgatct attactatgt gccaggtgtt ctgtctgaaa cagcatggga agaggaatca 197220 
gagacagaat tgccacctat gtggggcctt gctggggtgc agaaattttg tttctcacat 197280 
ttttgaaatg taatactttg atctgaattt tactcctgct taaggacacc aagatgaaag 197340 
cggatggcac cctgcccccc accctggtgt agtttgttaa gaaatgactt tagttctatg 197400 
aaagtggatt cttagttata aacataaaga actatcaccg gctagtaaga aaggatttta 1974 60 
ttgaaaagat gttgggagtt catttatctt cagggattct gcagatgggc agagagacta 197520 
gaaaaccaag agcacgaacc aggcataatt caaacttcac aacagtgagg gtaccagaac 197580 
agcttctagg gaacaacgcc aggcacgact gcctggaagt tgctggttac gggtgcttga 197640 
ttgtcactat ttcctggcat tgccagctgc agaggcagct aactaacaga gccaggggtg 197700 
catccggggc tctctttatg ggggcatcca cagtgggagg tagtgggttt tgatgcagga 197760 
aattactgtg ttgtatcttg ggttttctca atttatacaa caaatgtata catacgttca 197820 
gatggaagta agaagtacca catctgagag cggaagaggc actgtaatca cccatcccaa 197880 
agaagatgat ttcttaggaa aagaagtgag tttgggaaat atattagatg tattctcctg 197940 
tctcttccat attcttggga atttgagcca tcacttatct ctgatgtgcc ttgcttgaga 198000 
agtggtttct atactatatt tgatgatgtg gatctgataa cacaatattc aaatattttc 198060 
cagctgtgtt aactgagtct ctagaggcag atctcctgag aattgtcagt tgtgtttatt 198120 
ttccagttaa gtctactttc agagagccat tattgaaaga gaaaagtttc aacagaaaga 198180 
tgattattgt taatattatc gtggtaagac cacttaatgt gagatctatc ctgttagcac 198240 
atttttacac acccactgtt gtgttgttaa ctataggtgc aatgttgcac agcagacttc 198300 
aagagcttat tcatcttcca taataaaact ctgtaccatt actgttgaac agcaacttgc 198360 
catttctccc tcctttcacc tctgcaacag aatgataaac ctggagagca ttatgctaaa 198420 
taaaataagc cagttacaga aggacacata ttgcatgatt ccacttatat gaagtatcta 198480 
aaataataac gcttattcaa atacacggtg ggatttttac tgggaaaaac gctcgctctt 198540 
tttaaaaaaa aattgagaga gatttgcttt agttgatcaa aattataagc taccggaaac 198600 
tccccaagat ggcatattgt tgaagccata gtgtgtgcac aggtgttgaa taggcccttc 198660 
agagataaga agagtaaagt gtggtataaa aaagagaaaa agtgctagaa tgcagtcatt 198720 
gcaacattaa aagaaatgct ttatcttggg ggagatcctt tggccaaact atttcaactt 198780 
ggaaagatgt agtaatataa aaagatttca atacaaggaa ataaagaaat tgcatccgtg 198840 
ctacttctca aaagagtaag aatttaactg ataaggttaa ttattataaa attaactttt 198900 
accatctaca ttaaaattag gaattttctg agaatttgta ttctttttgt cctttcaata 198960 
aagaactcaa tatgtttaac tctattttaa acaggaatgg ctgcctatgt cttgcagttt 199020 
gaatgcaaga ccctcaaatg ctcaggaatt atcaagtaaa gtttatgtta agaaactagc 199080 
attctgttga atcactcaat aatggttatt atttaggata tgcaaaagtt aatcacaaaa 199140 
acttagactg aaagatactg cttacaatta taggtgggaa ataccataaa atacaacctt 199200 
tatttatttt ggaactcgtt ttaatgcaga ataatcctag cgctctattt agtaacaaga 199260 
agaaaccaat ttaaatatca tgctgattta aatagtataa ataataaaaa tgacaatttt 199320 
ttgttaattt tcagactcta catagctatt tttgtctctt ctattcttat gaaatattca 199380 
gggaagatat tattactttc actttacaaa tgggataact gagaactcaa taaattgtat 199440 
aatttatctc aagacacaga gctggttcat ggtagagcta tcatcagaaa tctcacgatt 199500 
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attagttatc cctttccaat tctgaagtcc tctgggaagt gtcagaattt acactcctca 199560 
ctaggaacta ctattgcagt tggaactctg aggctttatt tgggagtttt tcctgattac 199620 
agaaaatctc ttcttaagag aggttttaat aaacaaaata agtcggggat attaccagat 199680 
agcccctata tcctcagagc tcccagcatc acacctttta taagaacagt atagaaaaac 199740 
aaagttcctt ccctcctcct gccctgtgtt atttttccaa aaaacaaaca aacaaaaacc 199800 
taactacaca ctaagtgtga aagtgattgg agtgggggaa ggcaagctgg ggctgtacca 199860 
ggtcaagggt tgaaaagaat taggggatta ttacagttct agttcgaaga caaacacttt 199920 
tcaatgtatg tcaatgaact ttattgaaca gccgccatat actagtaacc ctcttaggtg 199980 
tgttgtacac ttacacatcg aaaccagggt ttcccagatt tcttatgatg aaacagatag 200040 
aaggtgataa tgctggcaca gcacacatgg acagtgtgac ggaaggagac caattaagga 200100 
cgangcgggt ccgtatttct acacacctgc caccagctgg acggctgtgc tggaagtcta 200160 
agccctggat gcttagctgt gtgatctgtg aaccggcagc atcagcgttc ccagggtgct 200220 
ggttagaaat acacctctca ggcccaccca gaccttctga atcagaatct gctgtttcca 200280 
aaattcccaa gtgattctca tgcaccttaa agctttagaa acactgttct aattagtgtt 200340 
cttaaaatat tgctatggga tacaggcttt agtaaattct caggatattc taccatgtac 200400 
tcaagtttga gaaacaggag atttaataaa tactgatgtt tggatcccat ccatagagat 200460 
tctgttttga ttggtgtagg gtgtaatctg gcattaacat ttttaaattg cccctggtga 200520 
gtttcatgaa tggcaatgtt tgggaacacc tgctttaagt taagccatga gctaattgtt 200580 
cttcatcata ttgcagatgt gggaactgag acgttataag gaaatccaat ttgcgagtgg 200640 
tagagccaga cttaagctta gacccttaac tcacttattc tcaaaactgg tctgtgctgt 200700 
ttattatttt acaggagagt tctatattgc tatgtttaat acattagtat catttaaata 200760 
tattttaata tatgcccrtg atggataata ctgagtgtcc acttgattgg attgaaggat 200820 
gcaaagtatt gatgctgttg tgtctttgaa ggtgttgcca aaggagattt gagtcagtgg 200880 
gctggaatgg cagacacacc ctttatctgg gtgggcagca actaaatcag ctaccagctc 200940 
ggccgggata taaagaaggc agaaaatcat gatggttaag caaactaaat atggcctgag 201000 
aaggactctg tacttctata ggtgagtcct tgtggatgaa ctgtaaccta gcttaatagt 201060 
cagacaaaat ttaaaaccta atttaatagt atgcactgta acaatggctg aatgttgccc 201120 
agtcccagtg ggattcaacc actcatagac tgcttttgtt caaactgcgt tcaaataagg 201180 
caaacgccga gctggaacca atctcactgt ttctgtacct cgcttctaat tcctgtacgt 201240 
cactttacct tttttgtcta taaatttatt ctgaccacaa ggcacccctg gagcctctgt 201300 
gagtctgctg tggttctggg gactgccgga ttcgcgaatc gttcatttct cagttaaact 201360 
cctttaaatt aattcagctg aagtttttct tttatcaatg aaaagactag actggcttag 201420 
ccttccagcc tgcacctttc tcccatgctg gatgcttcct gcccttgaat atcagactcc 201480 
aagttcttca gttttgggac tcagactggc ttccttgctc ctcagcttgc agatgcccta 201540 
ttgtggggtc ttgtgttcgt gtgagttaat acttaatacc ctctccttta tatatacatc 201600 
ctattagttc tgtccctcta gagaaccctg actaatacaa tgtctccatt gaaaagaaaa 201660 
gaaaatctat taataaactg atatagtcag agcccctcaa gaaaaatacc aaagaaactc 201720 
caatgttgta gagaattatt ttattgctct aaagacatca aaataattac atgccttgtt 201780 
tacgatgctt ttaagttgat cacattcaaa gatgtgtcct atccattcat gtacatacat 201840 
tcaatttcat attgactctt gcctttttca gcagtgatca gtaaaaatta tctaaaaatc 201900 
aatcctgaag gttatacttt atttttccat tttgaacctt gcccttgaaa gatcatttat 201960 
atattttaaa cacattttta aaaatgaatc tccagaagtc taatttctca tgctaaccaa 202020 
attagagagg ttaattcgat gacgaactct tagaaacaat catcattaga acttgggtat 202080 
tatttcaaac gcatctctct tccaaatctt agcagttgaa acaatttcac aggttaaacc 202140 
agccagcttg acagcccgtg aatggctact cactaaaggg tccaagcaga ggcaaaatga 202200 
tggtcctatc tcttaaagac agagcaacag aatatttgga attaaataga actggactta 202260 
aatctcatcc cagtggacaa ctgggaaaac ttggaaacat taatcctgct gaatatttgt 202320 
gcatcattta ataactaata catactgaat taagtagtta gcaggattta caacactgtg 202380 
tataaggagc ctggcatgtg tttcctgatc tttggtgttt ttagagttaa tattatgaca 202440 
gtgtgtaaac ttggtgtcga gaattaaaat ttgcttaaca gttctagtgt tctgtacact 202500 
gttacctatg ttcatcgcct ccaactcatt gaggctcatg atgctaaggt gaattctgga 202560 
ttgaaatcta ggtctgtctg cttatagggc ctttttccaa cttcagggtt tcctgctttt 202620 
gctcaattga tcctttcaag gctccatcac rtttgagcta cagcaaatca tggtgtccrc 202680 
tgttcactga tgggctatgt tccagaagtt gcaagaaaag taaaggggga taaaagaaga 202740 
gagagagaga agacccataa atattctgaa gtatatgact aatctagtat aatggaagct 202800 
cagaaaataa attgtatttc caaatatata tttctgaatt ttttggtatt tgttattggt 202860 
attctctgaa gggccacttt cttttctttc taatgctcta tgtccttaca aaggaatgac 202920 
tattaattca ctggaataag aatcacacta attggcacac agtcccaatc tgcttggcat 202980 
gtcatttaat gtgcagccat ttacctggtc aagtgaatac ttgaaaatta aacagttttc 203040 
ttgtgctatg attgatggca atgcagtcta gggttaagtc ttaaacgtaa tcccttatag 203100 
gatttcctag gttctccctc agagtttcta actccataga tttgggggtg ggtcccagaa 203160 
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tttccatttc taacacgttc ccaggcgatg ctgaagcttc tggtccaggt accacacttt 203220 
gaaaactgct gttttaatgc ttacatgggg tagcatgagt tagttttgcc tggggctttt 203280 
aattccaagt tctgctactt aacagctgtg ttataatgat taagttatgt gatgtctctg 203340 
tagcttgatt tctttgtata atgaggaaaa taatcatwcc cttttacata agattattgt 203400 
gagaatttaa taaacaagaa atttcttata atgcctggta catagtaatc actcaataag 203460 
ttatgtcaat tgtcaaagca catgtctgaa ttatagatga gagagaaata ttgtgaattc 203520 
tatggcactt taatggcctg ttctctctac ccatctctag ccgtacctta cactggctta 203580 
tcctgtgctc ttcctttcct gagggaaccc aaccacatat actgcctaaa gcactctttt 203640 
ggcacttcta acataggtgc aaggaggaaa tttcaagaaa ttatgacaaa ctacaagact 203700 
caaatctcct tttaaatttt aaaaatcaaa acatgtattc atatcccatg gcacatatgg 203760 
atgaagatat tgtttttctc agttaatatt ttagttttgc ttttgtgcca gagccttcct 203820 
ttttttttgt aaatgtccct tatcaggcta ctgcagaatt gcacaaatac tgaattgccc 203880 
caaaatattt gcctttaatt tgtttcttta aagcaaatgt aaaactacaa tgggttatat 203940 
ttacctagag gttttatcac catacgctta atttaaactc atataaaaat agaattagtt 204000 
atatttagtc ttattttcat cattcactta atttaatata attcttagtc cataacttag 204060 
atataacaaa aatttgtatt ttttctatcc tccatttgcc ctaaatctcc ttaaacaaaa 204120 
caaaacccaa ggaaacawtt ttgcctggaa aacgtgaagt ttgattagtt ccacctacct 204180 
taatgcactt taatttgggc cccttctcct cggtagtcta gaatataaac tttttttttt 204240 
ttttttttta agagatctgt ttggaaaacc taccaaaaca agacaaactc aatgatttcc 204300 
ttacagagag aagcctgggg aaagaattag caggaatttc tcaacgggtg ccgctgctac 204360 
tagtttttta tgcatgagtg cttgtcattt ggcagaaatc aattgttctc tttacaccac 204420 
aagagtggtg gaaatggggg taaaactgca agttaagact gctctgaatg tggattcaac 204480 
acttcactgt tgtcctccaa catgcgccct caaagtacca cttggctggg aattaaaaag 204540 
ggttgccttt agcaaagctt taatttaaaa tagtcacagg tgacagctct attatttttg 204600 
cacgytcaga tgcagccaca gctgttttta ttgatctcag ctgcatacac acccatttgt 204 660 
tgtaattatt agtttctgat gtagctgaag atacatatct cgaatgccta tcaaggctaa 204720 
attaaattaa ggaaatcgta cttttcacct tcaccctctt tggtcctgct ttgctgcagg 204780 
aaattgccac aatgtaaaac tttgagcctc atttcatctc cccccactag gagtgatttc 204840 
caaagttgtc ctaggttcct gatctattcc aatttaatag ttactgacat tttttccctg 204900 
tatatttaac acactacaag caattaccat gtgrcaatta tgaatttaaa aattaaaaaa 204960 
aacacaatta ttgctactac tgattaaaca ttgttaggac tttgaagaat aaaaaagtga 205020 
attcaacctg ccctcggctt tctaaaaatt gcctttttgc tcttaaaaat gcatgtacag 205080 
cttttttcat ataaaaataa tttcgagttt cttctatacg cataaaatat tttcagctat 205140 
ttttgatttc aggaagtatg aaaaattaag tatacacaaa agaattgcgg catgacccgt 205200 
tcgaamtctg tctcagtgac agacagtttt ggattgtaga cctcactttg ttttgcgtat 205260 
ttctgtttgt tgtttcacct gtggtccatt cactgtattt gatgtgttgc aaaacaatgg 205320 
cataacggyg ttcccaccag taatgtaaga agaaacatgg aggaagcaag taagggtcta 205380 
ggaaatttgg gagagaggaa aagagacata attaatgagg agagggatca agaagattgg 205440 
caggattttt agaagggaat ttgatttgca aggtacccag aacatggcac acaaaattgt 205500 
aagaaaaaca aatttttctt gaacatttat ataaagattc ctttgggaaa atagaacaaa 205560 
tttggttgta gagtggtctg gggttgattt taaattggaa aaacacagtt ttgttttttt 205620 
ttaatttgaa ataagagctg atatgcaata ttgtagggga gttggttgac tttttagtca 205680 
gaatagtggc ttttacaacc atggtgacaa cattaactgt ggatcctctt gacgtcttca 205740 
taagttcatt aatgctgggg gaaaatatgc atagcaacta atctaccaat atttgagaga 205800 
attaggcaac ctctcaagaa tgtaggaaag aaatgtaagt caacaagctt atacaatgta 205860 
aaaaccatga ataggcaggt aataaataat agaaattgat tttttctgtc cagtaaagac 205920 
catgaaaaga gaaatattat aattttaact tgagtgctct agaatactgg gtttgttttt 205980 
ttaattgatt aatatgattt taaaaaatta tcatgaattt aaaaatattt tctccattac 206040 
tttgagttat cactttaaac atargtaata tcaagatatg tttctgaatt atatatttaa 206100 
ttatgaccac taaagagcac ccatgaaatg gaagtgttaa cagcaatgtt ctgttgtagg 206160 
aagataatca tattggttat gtctgcttta tagtaatgtt attgttgatt gatttgcctt 206220 
ttcctaactt aagtcttttt ggtcttttct tctatttgtt ctaagagacc ctgttcaaag 206280 
ggacatttta ttttcctttg tacaacagta tttgcctgac atttactctc ctctaataaa 206340 
taatattaaa ctctagtcct aatacattta cttacctaag gaaatcggcc ttgtgatgga 206400 
cccccggagg tgagatacat ttctggatga ggatctgttg agaagttgag catttgttca 2064 60 
gctactaagt cactgccacc acctcgttag ctaaacatac tctgtgctca ttccttcatc 206520 
agtaaatttt ggttgggaca catgrcaggt gataatagag gttattagct gcaatgctta 206580 
ttaaaatatt tgaagatata ctttagcaca ttttaataat tattgagact gtcccttagt 206640 
aacagtagct cttctaaatg aaagctttta aaattgcatg taagtcaaaa caaagcatta 206700 
tttgttattc aactttatat aaaagctgtc cagtgatttc aaatatgtac taacatttaa 206760 
tttcaattca aaataagtat ttttatgttt ttataaagtg attaaatcac tttattgaat 206820 
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ScSttcS ^! aaatga ! ^"OTaatg aactcaagtt gtttctattc tattagcctt 206880 
tacttttccc ttcccccaat taaccccaag caaaaatgta ttgactttga cctgagaaga 206940 
gatacgtcaa ttttgttatt tattttgcct ctactcctat ttcccataca tatatgggac 207oJS 
atatatatca gcacaaatta gcaggcccac caatttctta ctcttggatc cagtcJgaJc 207060 
nnttctaact cgtgaaataa actaccatgc agcacggatg .tcttLaa ttLag^ta 207 "S 
iI? t9tC JS attttagtat cccctttagg ttgacattat ttattaaaaa 207180 
ggtagttttc aatcggattt atggggcatt atcgatagta agtctacttg atacattgca 207240 
catacctttc ttctgccaaa aagaagtgtt tacaaaaatg tatttgacca aatactttac 207300 
ggtaaacatc acrgtctgaa agggaactat cttctttgaa aggattaaag cctccagata 207360 
gatacaaata agattttttt aaagtagaca catgttaaat catggaacag caaaggggat 207420 
actttatgga aatgaattca caagataata atttgaattt gacaaggtga taa^aaagt 207480 
ccccagatag atttcttcca catctgtatg cctttcagta ..gttggt?. gcatattaac 20754? 

*? t ??„? 9 * gaatacas ! a "tattcatg ggatgtaaaa ctgcaaatta ttattcattt 207600 
ac^ccaSt IL a t m acaaaat * ca gtgcatacag tgttacacaa gaggtaccaa 207660 
actccattat agacattttt agagcaaacg tgggatcctt atatgacaat ttttgtcttc 207720 
aaatattcaa gacatttaag tatgaattac cactttttgc aatttaaaag aaatlttgtt 207780 

aata^o act" 0 ' 39 , 9 gCtaaCaaat tcttacttca aatgactaga" atctttJtJ 2078 4 S 
aatattatgt actaggatat gtggatatga acatatatat ttatacacaa atatatgtac 207900 

™ : a t a "^ t taa ttagt9taat tatgtat t.at.ttgnt atattttcct 20796 0 
aa^aaata 3 ! gC ^ acca 3 a ^^agaagaaa gcatcaatgt gtgattaaaa 208020 

c a IattcS a till t t tatccactac atttctaatt tgaaatagtg atcctgcata 208080 
ZcTnlt tattcataat tgagtttcat agtacattca ttttaattta gacatttata 208140 
tgagaaagca tgtttaatat gatgataaga aactgtatga aattttggga aagaactgtt 208200 
aagtgtgccc acacactgta ggcactgaat gtcacacttg agaatgtccc tggtcctagc 208260 
ttatctataa caggaatatt ttccrtcata tggaatttaa agataagaaa tgaatcacca 208320 
cgcagagaga tgtgctatca catcccatct atggaaaaaa caaatgaaaa aaaaaaaaac 208380 
atgaagaaag actttttgaa gtgcagcatc acctgcattt atgatatttt cagacat"a 20844? 
aaacaaatat tattctcatg gttttgaata aaattgtcac ttttaaaatt tgacatggga 208500 

aact^t? ItTtlllT 9 gga T ttCa ffcccatagtc aacaggtgat cLttgJJL 208 60 
aactgatatt ctctattagg aaggttagct agagtaggag ataaaaaaca acatgattac 208620 

;2 !!!!" tW ac " acttta atgtgaaatt gtgtagaaat gggggaaaag 208680 
c?aa a t aaa t ttlT a f CCaCttg a ^"aag ttggtgcaaa agttattgca 208740 

ctaattactt ttgcaccaac ctaattgtaa gttatgcttg aagaaactta aaaggagaga 208800 

tllllTtlt Tr^t aaaagCaagt ^aacaagaa gtgaaaatga ccaaagaaJa lolHo 
rlrlJrl 9 acatggccat tttgacaatc aagggagaat agtaatttaa agtacaatgc 208920 
"" a ^ Ctgg a 9^gactgc tcttgcacac aagtgaaagy tctcatcggt cctgtctgat 208980 
gcaaattaca ggtacattgc tgcacaggtg tcagatcgca catagctggg ctggcac?gc 209040 
tltt^tnt 9 gagaggaatg cagcacattc tccacaggcg actggaaagt ctggactctg 209100 
cttc^ta aa ^ aC3C ", ta gtwatcccct tggcaaatta atgggaaaac aacctcttca 209160 
cttctgtgtt ttagaatttc tgttttatgc cttttcattg tttcatgcaa ttttattttt 209220 
tcacgaggtc aggcacagct aactgttgat tttcaccaaa ttaccttgtt attggcttgg 209280 
T.TrllT h Ca r CCCttC Caagaaa 9 tt tcccaagaaa ctatgtactt cttS.ctJJ 209340^ 
Tartr.Tr- r ^ tacctatt ^ctcgtca gccattagag tgttccacta atatagtca! 209400 
agcagaacat ctggcctatt atataaatga atattagttt tgaagaagat atctttggag 209460 
ITtlTrTr "ff? 98 " ttttttcc ta cacaggaact cttggttttt cctcac?gcc 209520 
cattgtcatt ttttctacet tactcagaac atgattttcc ctcaactgca cctttctgta 209580 
actttgacct gtgccatcct tgtgaaataa gtgatatgtg atttatttta rtttctcctt 209640 
ctaagtgaac atgcagagtc cttcagagac catctgtgcc tcctgctgag tctctgtgtt 209700 
Itllttt 99 gCtagCttCC tccagagccc agtctcaggt agatctgaac cacacltgga 209760 
9gaga "" a a 9 aacaa 9 tc ttgtgacact tcctatgcft tgggtccaaa aactattgat 209820 
t^aS^o ccctgcaagt gtgtcagatt ctggcaaaac tagcttggag tttactgagt 209880 
ttaatgatac tgaatgacat caggactcac tcttctcttt tcattttttt gctatgagtg 209940 
alTatttTr »9c»»tgtac tttttttctt aacgcaattt a.tct?tL? 210000 

Jaaacit 3 ^ ITr, , tatatgtaca aagtgtttaa ttttgttact ttttaattct 210060 

tTJTJ "" taaaaa tcttaatcgt tgctattttc acccctgact ttatgcccag 210120 
atot^ 9 . ^ aa r Ca aatctttcta ctcctatctg tttagcaaaa tgtaaataaa 210180 
l 9 ^ a ! " tatttcaa ttttgtgtat tttttttaaa tttacttgcc tgggttgagc 210240 
atcttttagt atatatttca ttgtgtttta ctgaaaattt gacttaagac tSJtgLJt 210300 
! a C c g a C !r C tct f 9tca tgactatctc aggtagctga ggtatcactg acagcataag 210^ 
sagcgacaaa ccctcgggta atgcaaattg gcatatccga caaccccaag ctaacaagat 210420 
cataaccagg gacttgcaaa catgttttta ggacataaac tcrtcttaag gttttactta 210480 
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aagtascatc catctcatct ttaaccagcc 
aatgcagtct ttcttcattc tttcacccgt 
ctctccattg gagtaactgg actgaggatg 
acaaaaatct ttccagttac tactggaaaa 
gttctcacct accattccta gagtctatct 
gccaaacact tttgcagctc tcgtatttgc 
ccaatatttg gatatattga ggactggaag 
aaatactacc tttttccata gaggcagctt 
tggkttagag gaaaaaaamc aaaatgggtt 
ggaaggaagg aaggaaggaa ggaaggaaga 
ttctttttcc ccccaggaac atggcagttt 
atttgttaaa tattgtaaaa tactttcctt 
atttcttcta ggaaaccatt tacccttctc 
ggtaagaaat ctgtcaaagg cacttcacca 
ctgcaggcaa tggggaaatg accattactg 
agtttttgag cagttctacc tgatgaccag 
aacaggatga ctgacacgtc tcctctttgt 
atgtcgactg cacattttcc ttggccagtg 
tcaacagaaa tgttgcagac ctaacatgag 
gattggcctc ctgtgcttct gccatttgtc 
cccaaatgag aagatgcatg cagctgatct 
agcccatccc aacagaccca gagtgagccg 
atggctgttg ttgcaagcct ttgggtttcc 
aaaacttcat aaatacaaag ctttctgcat 
attaaaggca tgtagagtga aagtctgatt 
gctggatgca gcctagcagt tgtgctgtgc 
aaagtcagaa caaaaaaacc cacaaagttt 
atcagttggc tcactgaagt tcaagcagca 
ttttataggg attcttagaa atacctaggg 
gccataatac acaaagggac acaggctatt 
cacccctgaa ggctgaataa gtgtgagaca 
aggacaaatt caaccttcta agcacaatgc 
tgcctcttaa ctcataataa atagcatata 
gagtttttct actgactagg ctgcgccaaa 
ttcatgagat aaattatttt actatcttga 
gagtttagta gttacaccaa aatccagcag 
atcctttgat agtttttctt tatcccttct 
ccttaaaata aggtgggagc cagcatttat 
atacattaag gagaggtgga gaaaaaaaga 
ycatttatgt aatttatcta tgtatataaa 
gaaataatag acaacaggga attgtattct 
tagttgtatg aataacattt ttgagtatat 
tcatttagat tatattatat ccatgtatat 
gctaaattta tacattacta aaaataaaaa 
tgaaagtctt acatgaagaa tattagaaac 
tgtataattt ctacttcagc aaatcttgaa 
taatgctgat catatggaaa ctggatcaac 
ttacatgtag taatattagt ttacttcaat 
tccactcatt aaactatgca tatacatata 
gagagagaga gagagagaga gagagagaga 
agtacagtgg cacaaccttg gctcactgaa 
ctgcctcatc ctccccagta gctcagatta 
tttgtatttt tagtagagac ggggtttcac 
aactcaagtg aaccggcctc ctcggcctcc 
ggtgcccgga ctattcttgt actttgggtg 
gaattataaa gtgattcaca ccctcttatt 
ttagtataaa ttaattctgt atatttttat 
ataaaattag ctttggaaat aaaaagaaca 
tttaaactta caaaagcaat taagaagcag 
tattttgttc cctaatagac tacacaccgt 
gttgccacat taggctgaac tcacatttat 



tctcctaatc tttgcagaat tgtccattct 210540 
ggctccatta aaygtttttc tgcaggtagc 210600 
gactcctttc ttaagaaatc cagaaaagca 210660 
tttctgcagt catcatacat gtatcattag 210720 
atctatttac ctctgaggtt tcttagatta 210780 
attgtagggc cttggctgag actagatctc 210840 
taaagaaasa ctgccacttc acacctatgg 210900 
gaatacatga ttaggaattc actgtgactt 210960 
atttggttat atagagaagg aaggaaggaa 211020 
aaggaaggaa ggaaggaaga gaaagatcct 211080 
gagaaaatcg ggaataaaac awgtcttata 211140 
attgttagta gaattccatc ttggcagact 211200 
cccttctctg ggcactrcat ttctctttct 211260 
ggaatctggc tcttataaaa tgcartaaaa 211320 
aaactccttt ggagasggct tagcttatga 211380 
cttactatgc tcatctttac ttcctcctgt 211440 
catgtgacct tgcagagcct tcaggaattt 211500 
gctctggtct tggccattga cttgcatcag 211560 
cagaggctct aagtaaacgt gtgtgatttg 211620 
ggagcaacat tattcatgcc gccacaggtc 211680 
agacttaagc tatagcctag aatcacatcc 211740 
aaatcttccc agcagtttca agagcaggga 211800 
tggttatgta taaagcagca ttactcacta 211860 
gagtttctat tctgtgccct gcaagctttt 211920 
ttacctgaat tttaggaggt cagggccaag 211980 
catgggaagg gacagcatgt gcccccaaaa 212040 
gagtgtggaa aagtgctaca ggcttctgga 212100 
actatgaaaa agttgtggga catattagca 212160 
agagaaaccc cataaaaggc tagattttct 212220 
ggtatttagg cattaaaggg aagtttgacc 212280 
cctggtttat atttatataa atttttgcac 212340 
aaagtagcat atccaatcta ttaagcacca 212400 
gcaatcctgc cacaggcgct aacatttatt 212460 
atatttttca ggttttaact catttactgm 212520 
ttaaacaact aagtaaagtt ttgtttagaa 212580 
aagaaaatga ctgtaatttc ctatataaca 212640 
ccttacatct gtgactaaca cattatactt 212700 
aaaatatatg catatagaga aaactaacac 212760 
aagatcatat ataatcattt atttttatgt 212820 
tgacttacaa taaaaatgag tgtattccta 212880 
tggggtgaag tctacaatgg atagatggaa 212940 
tagcttatct tcaaaatgtg cttggtcttg 213000 
aaagaataat gatattaata tactcttact 213060 
gtgtcatata tttgagcctg aggacttatt 213120 
atagggatta cgtatcttga ggatgaaatc 213180 
aagcactctg atatgtcatc tattgcctaa 213240 
ttgcaaactg gatcaacttg caaactaata 213300 
tatctttcct aactctcacc ttgaaagtac 213360 
tatatatata tatatatata tatatataga 213420 
gacaggatct agctttgtca cccaggctga 213480 
acatccgcct cctgggttca agcaattctt 213540 
caggcatgtg acaccacacc aggctaattt 213600 
catgttggcc acgcttgtct tgaactatgg 213660 
caaagtgctg agattacagg catgagccat 213720 
tctttttttt tttttttgag tagtcataca 213780 
ctaaatgtgt tcagtctaaa tgyaaaagtt 213840 
ttaattttaa gagtggagag atattagcta 213900 
gatacggtta tagttccttt ccccaaatgt 213960 
tatcacaaga tttaaagttt ttaaggatat 214020 
caatagttaa agttcactaa aaatccagaa 214080 
tatttctgtt attagcttaa cttatattta 214140 
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ttataggagg gatgacttct ttttacaatc atttaatatg acaaataata tattacattt 214200 
tgatttaatg acaactttgt gctttatcta aaaggtagca ttatttttgt gtgtgaatta 214260 
atctatgttc caataaaact tccaacataa gtaaggctct gtatgtatgt tatgttcttt 214320 
ctatatctct ggattatatg gatttacata cttgtatgcc agtacatatg tacatatgtt 214380 
taagtataat atacatgtct gagtataata tatacattct tactgtacaa actatgcata 214440 
tgcatatcca tattgcatgc agctatgtag atagactctg acagcataga aaaattttaa 214500 
acttaagcaa agttgaatag ctctcaagta ctgatatatg tatgtgtcca atattatgat 214560 
tccaccattg ctaatgcaca caaaaaaaac taccaattta tttttcttat ataattatat 214620 
ctctgataaa attattgttt attcagaaac ctaatatcga ccattaatgt tccagtttca 214680 
gaaaacaaaa actcctaaga gaatgaatag aaattttctt aaggaattat aaagaaccat 214740 
ctcagaattt tactctgtta ctgcaaaatc tcatgcaaag ataaccaagt acggtatgtg 214800 
aacgtctctt cttctaaagc tcatgaaact gctctctgat tcctctagcg ttgacaraga 214860 
atctggagtt cagtactcca tttcattcct ggacatttag cccattattc ttgtatccat 214920 
atttcatatt ttcctttcat gggggataat tgtaagttaa aggtgaattt gttgactcaa 214980 
accttataag gaatcataga tcaacctaaa gatttcctaa aaatatacct acatattgaa 215040 
tacttgcttt gtctaagctg cttaacaaca acagaaaaaa acaagcaaac aaaaactcaa 215100 
aaacaaaacc aaaccaaatg aaaacctgat ggcccaaact tgtaaagtgt attttcttgg 215160 
ttaattattt aaataacaaa tatgaggaag aaaggcattc tagtgaatct gctttaaatt 215220 
acataggaga gagtcttatg taaactaatg ataatgaata tgttcggagg gtttgtatgt 215280 
ggagggcttt gtgttaagca ctgtgaatac 'tttgttccat ttaaacgcca ccttagactc 215340 
tacaaarctg agagagatta tgacagttga ccaagattac acaatgagaa agtaagagat 215400 
agaagtggaa cttgatcaca ggaagtcact tctaaagact caacatttga ccctggtctc 215460 
taacgcattg attttgagcc cagcagcatg tgaatttatt cctatcagga taaatgaaat 215520 
gcaagtaact tatttaactt gagtctcaga tacatccaaa taaaaataaa tgccctcatt 215580 
tatagcatca cattctcagt actaagtaag ctatctataa tgatacaatt ccattgaggc 215640 
agaagacttc acttcatatc tacacgacat ttcattcttt ctttctattt gttttttcta 215700 
gaagaattaa ctgaaattca tcccaagatg aggcatcttg gctgaggtga gatatgcaaa 215760 
tcatctgtgt tgtttgacag tcacgggtag ataaatcata tcactaacaa aagtcagttg 215820 
ttaaaacttg aagcacattt aataaaaaga gagattctcc cagtcacaca ggccaggtgg 215880 
gtgcatagaa atggcatttt atcacttact gaccagagag ttgcaaatac aatcccatca 215940 
cctaaaaaca aggaaacaaa acaataataa gaaaactttt ttttttaatt tttaatagaa 216000 
agtcagtttt gcttctgagt aggtattgta aaataaactt atgggttctt ttaacttttt 216060 
ttgctttacc ttactagaca ctgagttatt attttttgaa ggaaagtgaa ttataaatca 216120 
gaaattattt tctctttttg ttaataacct taggacataa gaaatacata tttccagagg 216180 
aatttataga aaaaataaaa gtaatgcttc cattattagt attctcccta aaactatagt 216240 
tagaaattat aaaagtaggg aagggtccat caatagtatt gattgaaaat gtaatgttac 216300 
atgtatcaga tacataaatc cataaatcca taacaatggc aaagggagac atcattacta 216360 
tttggataaa ttaatatact tgaatatttt cccacataaa ttcactaact attcaaaaaa 216420 
cttcacagga cttttgaatt tccgttgttt gttttactga ttatgacagc tcacaaagca 216480 
gtatgtgtaa ctttcaataa taatttagtg atgaaatatt accaaacaag aggagaggat 216540 
tttgattacc tatggtatta gtttatatca ttgtgtaaca aatcttcata aaaatataat 216600 
tcataaaaat tatatttcat aaaaatattc ataaaaatat aattttataa aacatattac 216660 
cacgtgttat cagctgggtc ccagacacag agtccgtcag agtccagtga agatgttggc 216720 
cagactacat tgtcatcaga aggattggct gggaagaatc tgcttccaaa ctcattcagc 216780 
tttggtagaa gttatttcct tgtggttgca tgactggtct ttttgctggt tgtagttgca 216840 
ggtccacatt cgttctagag gctgcctgcg gtccttagag gcctcatagg caaggaactc 216900 
atagtttctt gccatgaaga cttttccaac atggctgctt acttcttcaa gcctgtaagg 216960 
agatccrgtc tcagttggcc aggtcttgtg aaaggtaacc tcatcaccgg agtcaagtgt 217020 
catcaccttt cccatgttcc attggttaga agtcccagct cctgcccaca ttcaagggta 217080 
aggggttata caaatgtgtg accaccaaga agtcaagatt atgtgggtca cttagggtct 217140 
aactgccaca actacatatt tggatataat ttcaaagaaa gactgtgaaa aagcataaag 217200 
aattgtagga attgcctgag aattgaggac gttcatctgc aatgcctgtc aggcttacat 217260 
taaacaagtc tagcttactg agaaagtttt aaaaaatggg tttgtatgac tttacttatt 217320 
gggagaagcc atttgacttg tctttgaggg gaaaaataat ttaaaaatac agatatactc 217380 
tctctgcaaa acaattcctg gaatatgatc taacagaaaa gcccagcaga cctgattcaa 217440 
acctgatgtg aacacagaat gctttgaaat gagtaaattt aattattaaa cctaattttt 217500 
caggtgagga aactgaaaca ctggggttaa gtatcttttg cagggtcaca cagctaaaga 217560 
attttggagc aggaattcaa attcaggcag ctggcttcag acctcatgct attaaacctg 217620 
atattaaact tgacatccat cttacatgca aatgttactc tctctatcag gaaaataacg 217680 
catttataaa agccagctca gatgttactt ccaaccaaca aatttcctgg acacactaag 217740 
taatagagtt aatcacaccg tcctgtaagt actacccaca gtatccgggc tctaccacat 217800 
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tgtaatgcaa taggtgtgtt cacactgttt gtctctcctg gtgaatcgtt aattcatcag 217860 
agcatgactg tgttgtgtgt cccattatcc ttagtaaata atgtagtgcc ttgtgcatag 217920 
gaagaaatta atcttgatgg ctttcaagtt ccaccccaaa cagtcaagcc atgttgattg 217980 
aaaagtatct tctcagtttg agagattacc atcagagtat ttgaaacaag attgtatcct 218040 
tggggactcc atattgagga cattttcttc ataggattct aaggtctaat gacttggcct 218100 
gatcctaacc taatccttta gaaggctaga gaataaaatt aaatatgttt ttcacttttc 218160 
cagtggagta agcagggttt tattgtgctt cttgaatcat tgtcacacta gtaatctctt 218220 
caaaagcgaa tttgttctgg aaaataatgg gttagaggcc tctgtggagg aatacagtag 218280 
tggaatattt tgaaataatt gtaaagcaaa acagataatt gcgtttttaa gctctttgaa 218340 
atatatacgt aagatgttat gtgcctgagg ttgaacaaaa agataaatag ttacagtcat 218400 
tacccaataa tgtaccaatt acaaatagca ttgtgtgtac atgtcctcta ttataaaatc 218460 
agaacgggat gaaaaagatt ataatcacgt taataatttg atttgcttca ttcatctttt 218520 
gagttatctt ttcataaaat tagatactgt gttatttcct tctggaaagt acaatggtgc 218580 
atgtgaggag aatgttttca tgttaatcaa tgctgacaag aatattacac ttaaaaaaca 218640 
cagaaggcag gcaggttggg ggttggagga cagacaaatg aagaagacca gattgaaatt 218700 
attagttgcc agaaatatat ctattgactt tttacatttt tagaagtacc tctgttagct 218760 
tcccaaaata cagctgttta cagtgattga taagttcccg tgcttttaag cttattcaca 218820 
tccattagtt ctgccttttt cccctgttaa aaatctgaag ctgtcacatt gaagtagcat 218880 
gatcagctaa tgttcctgcc agattcaaag catgtcccca ggggtggcaa aagtattcaa 218940 
aatctcaaag aaatggcttt ttaaatgttg cctgttgaca ttggtcactc catttgctgt 219000 
atcagaaatg ttgaaataaa tatcaggcaa ttatagaaac aggactttga tgatcaattt 219060 
tctataaata ttttatgttt agcccaaaca aaagaaggca tttggaattc actattttgt 219120 
ataaaacttt tattttaaaa tagtatgtag tatattttca aaattggtta attaattttc 219180 
tagttttcac ttcaaataat gattatttac tctattccct ctagcattag attattttct 219240 
tgcaaaccgt tttttttttt tctttttttc tttttcctgt aatagtttta gagtttctgc 219300 
cttaacacat ttaggtatat tttgtaggaa caaagggaaa aggtatcagg gagaggaaat 219360 
gatttctaaa atgtaaattc caaagacagg gaatgttaaa agtgtttgcc agtcctgttt 219420 
cttacagagc acagaactca gatgctctta taaagataca ggataaatca catcatttcc 219480 
tgctccatca tcagaatatt attatatgat ttagatcact tttttaaaaa agaacatggm 219540 
cttagtacag aacaacagca aaagcctggg gaaggagagg agtgcaccat gaggagtcaa 219600 
tggggagcag aagccagtcc atttgactga tttggttcgt gtgcaaaata attgctaaat 219660 
aattgcatat atgtgagact ccgggtattt tcaaaaccag ctggcaaaat tgtgttattc 219720 
tctaccctct gctggctttc acgggttctc tgttctctct ccttttcctc cattctcctc 219780 
ttaccctaat tcctgaccac tgtaatccaa taatctaagg ttttaggatt tggatgacta 219840 
aggttaccca tggaattgtt tggaaatgta gacctgtaat ggagagggga gaaaatgaag 219900 
aaatagaagc tgtacgaaaa tagtcaatac tgaggccagg taccgtggct catgcctgta 219960 
atctcagcac tatgggatgc cgaagtgggc agatcactgg aggtcaggag tttgagacca 220020 
gctgggccaa catggtgaaa cctcatcttt agtaaaaaat acaaaaatta tccaggcttg 220080 
gttgtggatg cctgtaatcc cagctactcg ggaggctggg gcaggagaat cgcttgaacc 220140 
tgggaggcag aggttgcagt gagccgagat tgcgccattg cactccagcc tgggcaacaa 220200 
gagtgaaact ctgtctcaaa agaaaaaaaa aaaaagaaag aaagaaaata gtcgatactg 220260 
agacctggaa actgagagcg ttcatgaatg ctgcatccaa agaatatttt gtactacgtc 220320 
ttattcatga tacaaatcaa gcaaataagc acactctgtt gttcacatat gacactatat 220380 
aggtcaacgg ggtgagagag ccacccagaa acagacacac aattttgtta tcagatattc 220440 
tggagaggtt tctctgaggc agtggcatct gaagaatata cagatgttaa ccaaatgaag 220500 
caggtgggtt tggggaaaaa agtattccaa ggagaggggc agggtgtagg acaactaaag 220560 
agcaagggtg attagaatac tttgatgaag tgaaaaaaac actccaatga gtgagcgcgt 220620 
atgaagaaaa gggtggagag cagattattg aggtccattt agatcatgtt ttagatcctg 220680 
gacagcatcc ttaaaataat aaggagccac taaaagtttg aagaagggca atgacatgtt 220740 
ccaatttgct tttttacaag atcactctga cttctgcctg gagtagcata taaagggact 220800 
aagcacgata ccaggtggca cataggagct gaagtrgctc ttgtcaaaat aaggtgatgg 220860 
ggatgggagt ggatttgttt cctaggctgc tacaacagag aactgtgaat ggagtagctt 220920 
aaaataacag aaatctattc tgtcccagtt tgggatggag gtttcagcag gacatgctct 220980 
ctctaaagac tttaggggaa aatctgttca atgtccttct cttagcttcc agtgttgctg 221040 
gcaatccatg gagctccttg cttgttggaa tatcactcca tcatcacggt gttcttcctg 221100 
tatgtcttca ccttctcttc tccgtgtatg tctgcctctg tacctcttct cctcttccaa 221160 
caaggacccc agtcataatg gaatagagcc actctaatga cctcatttta actgattaca 221220 
tctgcaaata tcctatttct aaataaggtc atattcatag gtattgcaga ttaagactgc 221280 
aatatatctt tcttggggat atagtttaac cccaaacagg ggattattga aaggaagtga 221340 
gataagggtt agtggaaaga aaccatgtaa aggttttaaa atatattttt ggtggtagac 221400 
tcaacaggac ttcatgatta agtctgagtt tagggatgga agcagagtcg tggttgactt 221460 
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ccaggtttct tccagatgat gatgccattt gttgagacac aaaatccaac ggatcaagta 221520 
agtggtgggt tgagcatggg ttcatttttt tgacatgcca agtttgatct gtctgtgaaa 221580 
tttttatgtg gagctgtaga tttggtggtt aaaaatgcct ttggaattca aatgagagat 221640 
ttgagttgga gagataagtg tgggagtcat ttctattcaa attatcatca aagctatgca 221700 
agtagataac tttcctcaac cagataatgt ttagtcagaa gagaagaaac cttgacatca 221760 
acattaaaca ttttaaagaa caatgtggac tatttcagac taatatgaac aaaaaggacc 221820 
agactaaccc tgccataaga aaaaacaaac cgtcaagaaa cagacaggac atttgaaacg 221880 
atggttttag agacaaaaaa catcaggcag ccaaggacag aaaaataagc aaatacattg 221940 
atcccaacaa ttgcctgaga ttacctaagt tcaggtaaaa gacaatattg tctgctttca 222000 
atatttctat ttaatttgat attgaaattt gcattcatag tagtaagaca gcaataaata 222060 
aaaggcatcc aaattgtata ggaggaaata aactggtttt atctgtagac aatatcataa 222120 
actatataaa taactatgaa acatacaaaa aggtataaga gctaatgcat gattttagtg 222180 
aggtttcaaa attcaaattt aatatataac attaaaatca cttgatacaa gagcaaagaa 222240 
caatatgtaa ttgaatttta aaaaattgta cttataatag caaaaaatat aaaatatttc 222300 
cagataaatc tgacaaaatt tgtgaaagtt tcacacagtt aaaaatagaa aacactaagg 222360 
aaaaattaat gattacctaa ataaatgcag aaagatacat ttttaatggg ttggaagata 222420 
atattcttaa gatgtcagtt gtccccaaat tcatctatag cttccctgtt atcccagcaa 222480 
gctacattgt ggaaattgac aagctgaatt taaaattcct atgtaaacgc aaaagaattg 222540 
ttacagctgt aacacttttc taaaagaaca aaactgggaa taagtatatt tttaagacag 222600 
ttattagcat taagatagac aaatcaatag gataaaatag aaagtaaaaa aaaaatagac 222660 
ctatacatat atggacaact gattttggcc aaggggcaaa ggaaacttgt tgggagaata 222720 
gcaatatcat attttcaaca aatgatgttg gaacatctgg gtattcatta caaaacggtt 222780 
aatctttgat ccatatttaa aatttaactc aagttgaatt atagatgttg gtgtaaaatc 222840 
tttaactata aaattcatgg aagaaaacat atgagactaa cagtgtcaat tagcgttaag 222900 
taaagatgta ttagatgcaa cagaacaaaa catgatccat gcaagaataa atgtgataaa 222960 
ctggacttca taaaaattaa acaatcgcaa caacaaaaaa taacaaagaa atgaaaaaga 223020 
gcttctgctc tttaaaaatc attattaaga aaattaaaag acaattctcc atggtacaca 223080 
agaaaagatt tacaaaggtt atatctaata taagatttgt atccagaaag tataaagaaa 223140 
cctcaatgtt caataagaga aaaaaaataa aaatgagcaa aagctttagc agaccactga 223200 
aaaaaaatgg gtagagaata tgcatatgar aagatgctca atatcattag tcattaggaa 223260 
aatgcaaatg aaagcataac aagatagcac tgccctcttt tttaaaagtc taaaactaaa 223320 
aatgctgacc atatcaagtg ttggtgagaa tatggagaaa ctagaacact catatcctgt 223380 
ctgtaggaat gcaaaatggt taaattactt tgaaaaacaa taatgaagca gattccttaa 223440 
ataaagggtg atctatccaa ctactattag atctattcaa ttgatctagg tatttaccca 223500 
tggaagtaaa agcatatctt cattataaga cttctacaca aattatcaca tctttactta 223560 
cagcagctga aacctggaaa caactctaat gcccgtcaac agaggaatgg atggataaag 223620 
aaactgtgat gcagtggaat acgactcaac gaagatgaga ctaaaaataa ttatactgag 223680 
taaaagaatc caaacaaaat agagcaaaca ctgtgccatc ctgtttatac cttactccag 223740 
taaatgcaaa ctaatacaca atgaaaaaaa ttacttattt gagaactggg gagaggaagg 223800 
agagggaaag gggtagataa agaaaagagg agagattaaa aggagcataa gaaaacctca 223860 
gagaataata ggtttgtggt aaacattacc gtggtaatgt ttttagggta tattcacatg 223920 
taaaaactta tccaattata cattttaaat atgtacagtt tagtgtgtca gttatgcctc 223980 
tgtaaagttg attttaaaaa aagttcctat ttccaagttc acaatttcat ttgaactctt 224040 
aaaaactggc caggagcagt ggctcatgcc tgtaatccca gcactttgag gagcagaagc 224100 
aggcagatca cgaggttagg agtttgacac cagcctgaac aacatggtga aaccctgtct 224160 
ctactaaaaa tacaaaaagt agccagacgt ggtggtgggc acctgtaatt ccwgttccag 224220 
gaggctgagg caggagaatt gcttgtacct gggaggcgga ggttgcagtg agccgtgatc 224280 
gcaccactgc actccagctt gggtgacaga gtgagactcc atctcaaaga aaaaaaaaaa 224340 
aaaaaactta caccaaggtc aaagcagagg agctggcaaa agagactgag aaaaagagca 224400 
gagtgataga gaataagaaa tgagagctcc cacggaagct aagggcagga accacacagt 2244 60 
ccttcctcat gctacttaga agttaaaaaa aaaaatgaga gttaaaaaag gccattcgtt 224520 
tcagaagtaa cattatggac taatgaaaag gaatttagtt aaataaggca tgagacaaaa 224580 
gaaagtaggt tgactgagaa atgatgaaat agaggtagca agtaatatgg tttggctgtg 224640 
tccctaccca tatctcatct tgaattgtag tttccacaat cccacgtatc atgggaggga 224700 
ccctgggagg taactgaatc atgggggcag gtctttcctg tgctgttctt gtgatagtga 224760 
ataagtctca tgagatctaa tggttttata arggggaatt tttctacaca ggctctctct 224820 
cttgactgct gccatataag tccctttgct cttccttcat cttctgccat aattgtaagg 224880 
ccttctcagc catgtggaaa tgtgagtcaa ttaaacctct ttcctttgtc tcaggtatgt 224940 
ctttattagc agtgtgagaa cagactaata cagcaagtat aaaaaacatt aaatgtttgt 225000 
cttctaaggg aagagaggag agtagagaga gagagattgt tagctgaagg gtaaagtatt 225060 
cttgttatta agatatcaga gcctagaaca tatttaaata ctgatgagaa ggatacaagg 225120 
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gagagagaaa atttgaatca atagtttgtt tccactacca cagaaaagga tgagatccag 225180 
aggtttgata gaggaatgag ttttcctcat tggagagata agagcagatg ggtttgaatg 225240 
ctggatccta agggtgggat attaagggag tctgtttttt atggctttta ttttctcttt 225300 
gatcttggca gcaaggatgt ctgccaagat ttaagggttt ggacaggagg atcatagttt 225360 
taatgaataa aaacatatat gaaatagcca cttcagagag tggggcagac agtatgtgtt 225420 
ttagtttaga ccagagtccc tggagttata ggggtttgaa ttcaaatcct agctcctcca 225480 
ttcacctgtg gtactgtgaa caatgacaga accaagttaa gaccccaaat cttctttaaa 225540 
gtgggatgat aatggcatgt tcaccataga cttcttggga gtaataaatg aaatacttta 225600 
gttcaactat tttcacaggt cctgacactg ggttttgaat aaattgtgcc caggactatt 225660 
gctattataa agcaaaatga aaaaacacaa tggcccaatt aatatgtttc caaggagttc 225720 
aatactaaga agaaaagtat tgttggaaat ggaaatccac attactagat taacatgtgg 225780 
agaacttgct caaagaggag gctggtttag atgctgtgat aattttagat gtcaaatgga 225840 
ctggattatg gaaataccca gagaactggt aaagcatcat ttttggtgtt tctgcgagga 225900 
tgtttccaga agactttggc atctgcgtca gtggactgag tgaggaagat ccctccttac 225960 
cgtgggcagg tgccgtctca tcagctgatg gcccagatgg aataagatag aggaaggaat 226020 
atgtgtttct ctccttctta gggctggaac ttactcttca ttccctgctc ttggacaaca 226080 
gaactccact ttttggactc taggtattac actagcaccc cacttcccat agccgagttc 226140 
ttactcctta gatcttggtc tgagagttac accattggct tccagggcca cactactcca 226200 
tgccaggatc tccagcttgc agatggcttg tctggaggct tttcagtctc tataatcacg 226260 
agccaattct cctaacaaat gctatctatc tatctatcta tctatctatc tatctatcta 226320 
tctatctatg tatctatcca tccatccatc catctatgta tctattatct atatctatcc 226380 
tattggttct atctgtctgg agaaccctaa tacagatgcc agcaaaccaa aaccaagtgg 226440 
caaatgtcta ccattgacat tcctctcctt ccttaatgac tcttagtaac caaaaacaca 226500 
tgtaaaattc catgtaatac acaagccaga aaaggtaaaa tattgtatct gctagttcac 226560 
acagctcatt tcactaccac aatctaccat tattcaaata ctgatttaga aaatgacagc 226620 
ctccaaaaga ctaaaatcat cagtttttac ttagaagaaa aagaaatgaa attttaatga 226680 
cagggaattt taatcttact atttcaattt ataagatgtg actggctgaa tattgttatg 226740 
tactaaaatg ataaatacta tgtgttatta aatattttgt cattgaaaaa tgacctatta 226800 
ttctgaccta ttttattttc aggggtactt ttagataaag gagtttataa aggcttctgc 226860 
taaagaagac atcagagaaa atgtcttcca attgatttat cttgctttgt ttctgagcat 226920 
cttgactcca actgtattgc aatgaaaaat aattttttta aatggcagag actctgattt 226980 
gtcctaggcc ccttcttcta aattccaaga gtctatgtga actctttcta tgagtttctt 227040 
aaaattcacg aaagtagtct ctactggttt aaatggtgtc ccctcaaaat ttatgtcatt 227100 
tccagaaccc cagaatgtgc ccttatttgg aaatagggtc attgtggatg tacttagtta 227160 
agatgaggta atactagagt agagtgggct ctacatccaa tattgctggt gtctttctag 227220 
gaagaggagc agagccacag ggaatggtga ccatgtgaag acagaggtag agattgactg 227280 
atgtgcctgt cagccaagga aggtggcgca ttgccggcca ccatcaggaa gtcggagaga 227340 
agcctggcac agattctctg tccagtggga accagcctgt gtgttaattt cagacttcta 227400 
ccctccagaa ctatgagaca ataaattgcc taaaactgtt ttaagccact cagtttgcaa 227460 
tagtgtatta tgtcagccct aaaaagctaa tacaaagccc tagagaattt caataaaggt 227520 
gtcaggaaat atagatcaca gacaataatg ataggacctt ctagcattct tacacttctg 227580 
taatgtttgt tgctctttcg gtctctccta tatgctttta ttgtgattgg caaaaatccc 227640 
ttgaaaaaac caagaaacaa tacgaacaga aacaaagcta tgtacttgtt ttgggaagca 227700 
acctcaaatg gaaagatggg ggctgtgttc taaaaggtga ggagaaattt atagcgaaga 227760 
taaagagata aagataaatg gtcctgtgca ccactgaatt ctgagaactc tgaacaaaac 227820 
ctaagtatat gttagtgccc agtgaatagt tgttgaatga atggatgaat caacaaataa 227880 
atgaattgat gagataatca atctgtgaga caaaaagcaa cacaaatgag tagacaagag 227940 
tgggttagtt tggctaaaaa ttgaagatca cccatttttt cagccaagta ggcaggcatc 228000 
tgctacttgc caggcatggg tattggtgtt tggcataatg aaatagcaaa acaaagtcct 228060 
actgtcaagg aacatagact cttggaattt agaagaaggg gcatgggaca aatcagagtc 228120 
tccgccatta aaaaaaaatt atttttaatt gcaatacagt tggagtcaag atgctcagaa 228180 
ataaagcaag ataaatcaat tggtttaaaa tatctttcac actacccttc catggaaggt 228240 
gcagagaagg ttccagtcta aatgtaggca actggagact cccttttcct atattccttt 228300 
aaaccctatg ccccaaaatg tgtaatcaag tctattcttg aacagaaaag aacacagtgc 228360 
cctttctctt tggaatttta tataagcccc taagaaaagc tgctattgtc ctagtttgta 228420 
aaagtacaaa tgggtgagta tgcatctgct ctgttcagtt ggaaaattag tacaggtttc 228480 
agcactcctc gataccaatt gactattgaa tgcgtactaa gggaagcttc tgtacagacc 228540 
agtaaaccac acaattgcag aaatcctcta attaaagaaa cacaccataa tcttcctggt 228600 
gaatctgaag aaaaggtttt ttagaaaagt acatctgtat aaggtacatt tgtgtaaaaa 228660 
acaatcattt ttcattgaag tgcagctgta ttgttagatg ttgttctaag aaaactcagc 228720 
ttaaaatttc ttataggctg ctttggctta aaataattgc ctaactggat ggtcacaaat 228780 
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gaccaagaaa ccaaatatat ttcatagtat taactatcat tacagagtgt taaataatac 228840 
tgcacagtag agatcttttg aaaaatattt taaatgatag attctatttt ggtttgaggc 228900 
aaagtcttaa acataacaag ttaaataaat ttttaatgaa attttacaag tgaaataatt 228960 
aatgttcaag acatagaatc atgccttttg gaaatgactc tcatgatatt ctattatttt 229020 
aaaaactatg tgcctatttt taggaattaa attgctctgg tagatacccg ataaatatct 229080 
atctcacaaa tcatgttttt ctgtcatatg taaatttttt aaatgtttta gtttataggt 229140 
tgtttggctc tgtcacccag gctggacagc agtgctgtga tctcggctca ctgaatactc 229200 
tcctttctgg gttcaaggga tcctccctcc tcagcttccc aagaagctgg gactacaggc 229260 
atgcaccaac atatctggct aatttttaaa ttctttgtag agatggcatc taggtttccc 229320 
tggttggtct tgaactcctg ggctcaagca gttcccctac tttggccttc caaagtatgg 229380 
gattacaagc atgagccact atgcctggtt agtttctaat ttttacattc agtttatttt 229440 
tatgtgactg gcatttgtgt aggattattt ttataaccaa tatgttttag gtatgtaacc 229500 
cagaagtaaa tcactaatta atctgcaccg gaggaagtat aatttaggga gagtagaaac 229560 
tctatctgat agactgcact gcaggtaagg tgtcacagcc ttgaagcgcc tcagtgactt 229620 
tgggaaaaac aagcaccaag actgaccttc cagatccaag gtctggtagc aggatggttt 229680 
tcatttccat ctgttggagg tctcagcttc tgacataaag gcctgcttta tacaaaatga 229740 
gcctgattga aagatgtaag ccaccaggtt agagggagaa ctacaggagg cagacaggag 229800 
aaagtaagag aagagtgacg ttgttcttcc actagcttgt ggctactaac aaatttcatc 229860 
agaaagttgc tgaggattcc tcagaacaat ttacagagat aactgtaagc tgcagagggc 229920 
tatgaaaacg ctcatgggtg ctgacaggag cccctggtgg gtggtgggtt gggaatacag 229980 
gggctctgta gggggtgaag gaactgtgct tgacaccaat caatgggagc ctggagaaac 230040 
tgaggcaggg actggaatcg tgctgagact caaacgtttt gagttagagt actcaccagg 230100 
tactctaact caatcatatg ccttgggtat aaggaaatta ggaataattg gaactgtggc 230160 
tggccccaaa tttcatgccc tatttatagc cattcatcaa atttagaatg aggtaaagga 230220 
aggattcttt ttattatttt ttaaaatacc cacctcattt tttacaccca catcataaat 230280 
caccgagagg gagaattagg cctgtggacc tccctttttg tagtacctca tatgtaattt 230340 
tttttttttt ttttacagat tcatgttttt tggtacaagt gaacgtttta taacraaaag 230400 
gcttttttgt tgttgttttt attctctata aaaatacctc tgggtgaaaa ttgttaagaa 230460 
agctaattat tgtatcttta catgactcta aatttggtct atttaaaaag tcaaaacata 230520 
gacatttgga aatgactttc tcctttcagt actgaagttg agagggagaa ggaaaaaaag 230580 
tgagaaattc cagtgcctgg atgttcccag gctaatttct agacaagtga gttttgataa 230640 
gttcaagttt cagatgagaa tttctttttt cttctctcat tggttttccc acatcgtgag 230700 
ttaaaatgtc tctgaaagag tctgaaaaaa tatatatatt aatgtggaga attatttaca 230760 
taaaatttta ataaactaac atgagaatga gaccctccaa taaatattgt ttagagttat 230820 
tttaatagtc ctctccccta tcataaccca tttaaaaatg tgggtatata tatatgtgtg 230880 
tgcattcata aaaatttagt gtgcagttct tagtttttct ttatgcaaaa cacacaaaat 230940 
aaatgtgaat aattgaagcc attacacaga catttttcct gctaagttct gtaaatagat 231000 
attccaataa agacactacc aacatcttac tttgtcttaa atataattga gagtatttcc 231060 
atattaaacg ttttaaataa aaatgattgg atttcaaatt gccttaaaga ttcataatgt 231120 
aatattcttc accctaaatc acccagatta tttctaattg tttctttatt ttgccttcat 231180 
aaatacagcc tttgattaat ttagcggaat ttatcaatct gagatatttt taccttccca 231240 
•cattacagca tgccatggaa tgaaaactta ctttgttgct caaatgtatc aaacacagtt 231300 
tctgtggtca agttcctctc cttttctaaa tttgcttaga ggatctcata aaacgtaact 231360 
cctctgacaa gggaaccatt ttagcaccaa cactgcaaaa gcttctgtgt tcctaaggga 231420 
aagatccttt cctgaattaa atttaacctc tttagtactc ccatttagcc acctgataaa 231480 
tccacttgag ctatcttttg ggaagagaga ggtatctggg aacaataaca cttccttttt 231540 
gaacagttta ataaagcttt gtgagatttc aagatgaaag ataatgtgta atgctgatag 231600 
tgccctccaa ggctctgcat tcatggatcc aattacgttt tttgtcatgg taaaagccac 231660 
agtggatata ttaaatraga gtgtggttta agaatgaagg cccaggagtc tggagatctg 231720 
gtttctaagg ctgacttcac ttctgctttc ttatctcact aggggtatgc agcttaacat 231780 
ctttacctca gtgtctcctt ccataaagtg cccagcaaat atctgagaaa ttcttttaaa 231840 
atgacagtgc aaattgccat ttagaatcac cttggaatgt ctagaaaaaa atgaggtgaa 231900 
tgttcgttca gaacgggaaa tacaatcttc tgcttaattt cagatttatc atcaagtttg 231960 
tcttttactt tttaaaagtc atgagatttc cagaagagga atgacatttt caaagcctta 232020 
aggacacttg tgaaaacact ccttcaacta ttctttattt ctcaccaaaa ttttgactct 232080 
ggcttttctc caacttcctc tcttccccaa ttcccaattc caacttgaaa agagtaattg 232140 
caatcttaac tttttccatt ctggagtatt ttcttgcagc cgatttgttt catataattg 232200 
taaatttatg catgtcagat tttgttttag atgcacgttt gctggcttta gtcagtcaat 232260 
ccccaaaccc ttaaactgat taatttcacc gtgaaataat tgattacgaa tttgtagcat 232320 
tatctgactt tattttaaaa tgcaagttga tagaacattg tcatttattg attaatctat 232380 
tgccctttgc cttgagataa ttttgaagaa gcagaatagt gccattagta agacaatatc 232440 
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tatgtattac aaatgatcat gttcttaaaa tacacacgca cacacacacg cgcatgcaca 232500 
cactcacact ttctagtaat aagcagtgtt ttgagtaaaa tcttttgaaa gatgtttagc 232560 
aaatcgaaac aatttctgtc ttcctatgtg atgcctgcaa ttatatggtt tagatattta 232620 
ttgaaggact tgggaaacct catagtccca ttggcattat tctcaacata tcaaagattc 232680 
attcagtata aataaataat ttcattactt ataataataa aagttatcat ggaaaatttt 232740 
ataacctagt aaataggatg tcatagaaat attcaacacg gataaataaa atatatacat 232800 
agtttaaatt agaacacctg ttatagaatg gtacagtcta actctctggc caggtaagct 232860 
tctagcatga aacttcacac aatcatacct acaatatgtt cttgcctaat tccttgttcc 232920 
aaaggaaagt ttgataagga agttgaataa tttagcttcc ttttagagat ggtagagctt 232980 
atttaagtag ctttcctgta gtacaaaaag attaggcagc tctgattgga actctagatt 233040 
gaaaatgagt agaaataacc tcttatagaa cacgtctaaa tttaactttg cttcacaccg 233100 
ataaactata gcctctacta ctttatagac agattatagc tcatcagtag gactaaaaat 233160 
cactgtggaa ttactctgca atagtcctgt tgctatgtct ataaatataa tttcagtgag 233220 
tgacttttcc attttctagc tgcttggcta tacttttttt gcgtggcaaa ctaaagggct 233280 
ttaggaatat ggtaaatgac aaatttaaat agcctcagat aaaacatgaa agtcaacatt 233340 
aaattatttg gttaaacatt attttcagaa caattttaaa aaataatatt ttgcaactga 233400 
ttttaactta atgtttatac tattctaaaa aggtaaaaaa aatgcaatgt tgtttattct 233460 
cgttaagcct aacatgaaaa taaaagcatt atttagggga aaaagtccca tttaattatg 233520 
tgaacacata attaaataat tatcataaga ggcaaaaaaa gatagatagc atcagatatt 233580 
gtagcttggt agagcttcag tactaggggg tcaaacagtt gaaaataatc tgggaaacag 233640 
ttttaagtgt tttatattta aaacacctac acacacaaac acacacacac acacacacac 233700 
actctcacgt aggcagtgtg aaatttacat tgttcctttc ccatgttatt gttgtgtatg 233760 
ggcaggctta tttttcacat atttccctct caatagcctt ggggttctaa agagagacac 233820 
aacattatca cccaagatgc ttgacaaaga tgggtatttt gagggtgatc cagatttact 233880 
gatgatatga ggcctggaaa tctgaaaatt taacaaactt tcctagtaat ttttataaaa 233940 
aacaaaattc aagaaaaact actgcatgaa aaaatgtgaa gaagccagtg gatactttta 234000 
aaaaagtgtt aaaatggttc ctatgtaata gacaaataat ctctgagcta gataaaagtt 234060 
caacatgaag tataatctaa tattaaaaag cccagagaag ttgctttcgt aagttccaac 234120 
attagcattt agctttctcc tggttaaaag tttaaaatga agaagataca tactctgcat 234180 
agggctgagg aaggtaagag acccaaggac ataatgatct ctgtaaatgc ctctgtgcca 234240 
tttccaaata gccgtcctca tgccttctct tccatccttc tttccttgtt accgtctctc 234300 
ttccttcttc tgtctccttt gctgcaacag tatttaagtg gatatataat acatattaac 234360 
tctatcacat cttttaacat acatgttggt ttaatgactg tggagcgatt gtaggaattg 234420 
tccattttac ttaatttttc agaggttttt ggttaagaca aaattccctc agagactcaa 234480 
atgtcatctg ttgattgaaa tatacatctg tgttttgaga ttaagcccaa acgaagggtg 234540 
gcttctctct ctggatcgga ctatactttg agtcatttag ttacagtaaa acaaaaatat 234600 
caaaatggca cgaaaaaatt aactgtggtg cctaattctt tgttcctcat gaaatggatt 234660 
gcctatacca tacttaaata acatgtgcat agcctttcca aagctccaaa cctcacgatg 234720 
aaatagtcct ctcgtccctt ccctgtgtga gtgttgggaa gattcctttt ccctgtcaag 234780 
gggcctttcc tccggtgttg accacaaggt agttttttga ggggatctta agttgtatta 234840 
gtattttatg agttcttaga tagggttaaa taattcctct tactttggcg tggaaagaga 234900 
atggagtgtg atagtgaatg tgggaaaaag ttgtgaaaga ctgtgaaatg agctgtaaag 234960 
attactaaat cacctgacta gcagagtctt ccaagatttc tctgcctttc atcatctttg 235020 
agctattttg caactttgta aaattttaaa aatatgatat aaatgcaatt gactttcttc 235080 
cagaaaaatg caaagagagt gtccagtgaa aatgctaagg attattgctc tgtgaatcga 235140 
ccagttttac ctgtagatga ttttaaattt gcctttgaat tggttgtaac atcttattgt 235200 
ttctcttatt aatcaagtaa aatctttgat tcatgttcat gatacatctg tatgagctta 235260 
ccatgtcttt aaatgcctta tcatgttaac ataacctact ttctagacaa atagsactcc 235320 
ctttcttatt ttcaagtaag tagcaaacac ttatttgaaa ttctgagccc atcaatttat 235380 
tttttatatg aaaggatgaa catttcctag aagccctctt ggtatcatct aggtaaactt 235440 
catatcaaat ctgtgctaat atgttatttg tcctggaact ggttcttagt ccaaatttta 235500 
taccatttat tcgactctta tatccttcac cttgatcaat ataaagtgca tttgccacac 235560 
attcggtttg accaatatta ttctctgtat agtggcttta actcaattat caaactgtat 235620 
tttaaaaatt gaatggccag gtgtggtggc tcacatcttc tataatccca gcactttggg 235680 
aggccaaggg gggcggatca cctgaggtca ggagttggag accagcctgg ccaacatggt 235740 
gaaaccctgt ctctactaag aatacaaaaa aattggccgg gcacggtggc tcatgcctgt 235800 
aatcccagca ttttgggagg ccgaggcggg cggatcacaa ggtcaggaga tcaagaccat 235860 
catggctaac acgatgaaac cccatctgta ctaaaaacaa aaaattagcc aggcttggtg 235920 
gtgggcgcct gtagtcccag ctactcggga ggttgaggca ggagaatggc atgaacccgg 235980 
gaggcggagc ttgcagtgag ctgagatcgt gtcactgcac tcgagcctgg gccacagagc 236040 
gtgccatcaa aaaagaaaaa agaaaaaaga aaagaaaaca aattagctag ttgtggtggt 236100 
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gtgtgcctgt agtctcagct gcttgggagg ctgaggcagg agaattgctt gaacccagga 236160 
cgcagaggtt gcagtgagcc cagatggcac cactgcactc cagcctgggc aacggagcaa 236220 
gactctgtct caaaataatt taaaataaac aaataaataa aaatagaata ttttggttgg 236280 
gtgtacatgg gtgagctgct ttattttatg tatgtagaat ttctaaaata attgtgcttt 236340 
taccgtctag cttctataat ttataaatat gtaataaata gtagataatg ttaatgagtg 236400 
tgttgtttgg cattttgtta gttggcttaa aaatgagaat gataaaaaag gcatggtatc 236460 
agggcatttt aaaattatta gccatggggg taaaaaagaa aaaggaaagt gagacaagtt 236520 
gccagcatgg ctacacagag aaattccagt ttccccaaaa ctggaaaaga atagcatttt 236580 
atatgaggac agtgtgctta atacataaca aagatagctg ctatatatta gaaattatgt 236640 
ttgttattcc ttaagcagtt atttttaatc ttgatagttt attctggtta gatatataac 236700 
aagaatataa ctctagtctc ttcttgcaaa aaaaaaaaaa aattccaatg agcaagatga 236760 
ttaatccagt aaagaaacat gtccagcctg gggacaaaaa aaaagtagat caaattttgg 236820 
ttaggaagat taaccaagta aaaatccatt agctggaatt tattgtaaaa gccaataaga 236880 
aatcatccaa gaaataaaaa actaaaaata gccctcaata tggaataaaa aaagacatag 236940 
agtgattctc tttgcattca atagagactg agaaaacaca actgtcagaa tattaaatat 237000 
tgctttggaa agtttggaaa tatgatgaaa gtaaattaga caattgctag aaaaatatcc 237060 
ttcagttggt tccccacaca tcacaaaatt cagcttgata ccattttctt tttcttgaca 237120 
aatttagggc ttttattagt gctatgacag aaagaaggaa tgctaccaaa aagaccattt 237180 
gaatttactt taattatttc agagtaggga aacgataatt ggaaagtagt ttataacctt 237240 
gagtttgtaa aatctgtaaa ctctcacaac gatgtgaagg cagagaaagt atcaaactca 237300 
gtcccctata ataagctcat tacttcataa aatatatatc cccttcatct atgtgtcaga 237360 
aatgtttaaa attttcacca ctggtacatt atttttgggc cttacatttt tgtggatttg 237420 
gtgaatacct gaaagaaata aaataggatt ggataatgtg aatttctttt acactcattg 237480 
tcattattag aaaataaagc ccaaataaaa tatgacagag cagaattagt tttaatacag 237540 
gaaaaatttg caagttgaag aggagaaaat gcagccagga actaacggta cacacctagg 237600 
tggttaagac agtgtggatg gtgataagag aacataagga aaattctgcc ctccagctac 237660 
tttgaattta gtgttctgcc tagtatgaat attattgaac cctatggtat ggtacttaac 237720 
tatctaatga gatgataact ctctcctacc tgtagaaaat gtagatgcct tgtctacttc 237780 
tgcttatata catgctgaaa catactgatg tctgctgtct tcttaatttt acactaattt 237840 
ttaaagtaaa gattgatttg ataaaaaata tgcacttgga aaaattattc tgctattccc 237900 
acatttcatg ttcatgtcta tttacagcta acaagactag cacgcaaata gatatatgct 237960 
ccaagatctc gcttcttttc atgtctggct tctccctgtt tacccacaat tattttcata 238020 
atattttact gtgattccac aaaccagaga caaatctctg aagggaatag tgttccaagg 238080 
catggccatc attagaagac aataagcttg ttttccttcc aacattctta gctttgttga 238140 
aggctaagtg gaagaaatta acatactgca tgcctttcaa tgaaaattgt ttctaattat 238200 
tttgcttctc aaattattcc accaggcatt gattagactt tttaaattca tttcatatct 238260 
gtcactctca gctaacattt gtaatttttt aattgttgtt aaagagaatc tattaattga 238320 
aatataatta tcctctttaa taaaagtaaa taaaattact tgggtagcca acatattcaa 238380 
gtaatcatca atcttggtca gaaattaaaa gtaagatgag catctaaact tggcactcaa 238440 
agactgacca tctgcacctt tgaggcaaaa taaaattcag tttaggaaaa ccttatgaaa 238500 
atcaatgttt taattatgct atcaagggct gtggtaggca gaaacttctt tgctccaccc 238560 
cccaaaaatg tccagttctt aatcctgaaa acctgtggaa tcttgtgtaa caaagggact 238620 
tcatagatgt aattaatgtt aacgaccggg aggtggagag tgacctcggt tatcctggtg 238680 
ggtccaaggt agtcacaggc atctttataa tcagtgaagg tctccctgct gtgcttagag 238740 
agggggatgc tggaagaaga gtggtcagag aagcaacatt gctgactcya aagacagagg 238800 
gcaggccacg agccaaggca tgcaggaggc ctccagaagc tggggcaggc aagagaatgg 238860 
actcagccct ggagcctcca gaaggcatgc agccctgccc attccttgct tttagccagg 238920 
taagatgtct gtcagcctga tgagaactat gaaataatga atgtgcttgt ttttttcacc 238980 
actaaatttg tgcttatttg ttatagcagt gatagaaaac taagacaagg ctatatcttg 239040 
gcatgtcaga ttttaagtta tctgctatag tttagaaaag cccanagaag gagctgcaat 239100 
ttgtctcttt gaaaaaaaaa aaaaaaaaaa ggatcacctt ctaggggatt aaacattaat 239160 
gggctgggtg ttgtggctga cacctgtaat cctagcactt ggggaggccg aggcgctgct 239220 
gcactccagc ctgggcaaca tagtgagacc ctgtctccag aaaaaaaaaa aaaaaagaaa 239280 
aagaattaaa cgttgacaga ttattaagtg aggcactgtg tgagacctag cttgggtttg 239340 
cgcctataga tatttaaaag ttgatccagg ttttcacttg aaagcagaac tactgcctta 239400 
aattcccact tcacaaaaat aaatcatgga ttgagggaaa aatttacttg tttgctttat 239460 
aaaaagagat atcgatctct tcataagtta tatatgtttt acaaggttgc actgtttgaa 239520 
ctgtaagtgt gaatggttgc taacactcca gaaaccaaga ggtacatttt aactggaaaa 239580 
attttcttcc tgtatcttac aaattaattg aagtttgtct acgcttgaaa atatgtcaaa 239640 
agtattatgt agctttacca cagttttatc taacttctag aagaatttaa attcatataa 239700 
agataaagag taacatacca atagagttta gagagttttc agatttgctc agaagaatgc 239760 



100 



WO 01/40493 PCT7IB00/01507 

tcytttgaaa acctatgaag tagattttac ccaatgtata tctggatcta aaaattaaag 239820 
aaaatatcag tcgtgttcag attcctttat accctgcaga agagagaaag aataagcaac 239880 
atcaatggag catataacat gtatggttta gcactttcat atttttaatc ttaactattc 239940 
agccaaatac gtcttatttt tttattttta tagatgaaga tattgaggtt tgcatttttt 240000 
ttttataaat tgctcagtca tacagttggt aaatagctga caggagagtt cactccaggt 240060 
cttactggct ctcaaacctt tgctttttaa aaatctccaa actagcgcct ctccctttta 240120 
actgcaaaat aagcacaaag ttatcccagt tctacacagt gctaaacaaa ttcctgtgag 240180 
caaacaatta gcacaggcag atggagtgag ggggaggtgg tttataacaa tggaaagctg 240240 
tctgcctgct gtggattaaa gattcaaact ctttgatcct cctcccaggg ataggtgagg 240300 
gcctatgtct cctcctcatt aacatgagtg atccttggct aggttctcct ttgactgaca 240360 
gaatatggga gaaagtaagg ctgaaccata tactgggctc aggcttccaa ccctgacatg 240420 
ttccacttcc tgtctccagg aacattcacc ctgggagctc tgtgctgcca tgcaaatgtc 240480 
acattgcact gaggccgcca tgctgtaaaa aaccccaagc tacctacctg aaaagctgga 240540 
gagaatcagc acccatcagc ttttccagca ttttgggtcc tcccagctaa atcatcggca 240600 
ttgtgaagca gagatgagac ctcctgccat gccctttcca aatcctggac caacagaatc 240660 
atcatataca ctagtaagct tttactttaa gccactaatt tggggtgtag ctcttttgca 240720 
gcaaaagatc atcagaacac tgccatgcag gagctgataa acttgctcac tgttatgtat 240780 
gtatccaatg actactcaca cacacattac ttgcttcact ctactttctg gctttccatt 240840 
cagcaaattt taagaaatga aaatgttatt cgtgggaaat taactcaaga tcaatattgt 240900 
tgtagttaat ctctttgtaa atggtaaaat attaaattat tgtgagtcat ttttatttaa 240960 
agaagtttgg ttctttaaat gaaagaagtt tgggtctttt gtggatactc tgcatataga 241020 
cattctagac atgaaagaat gttgaagaca aagaatgttg aagaaatata gacatgaaag 241080 
actgttgaag tttggttttt tgtggatact ctgcatatag acattctaga ctggagtgat 241140 
acatcttgtt caggtttcta tgtttttaaa atgataactt gacacgccta ccaatgccaa 241200 
tttttaggat atatatctgt atatctgtgt gaactaagga aaattcagga atgattaaac 241260 
ctacatattt tgccaggaga gactggataa tgtaagtctt tatatttttc acaacttcaa 241320 
gtttatagca tccaaatgag caattgctga catacctagg ggatttctct gcaggttcaa 241380 
gtctgatctg gcctgagggc aacactctta ctgagtgtgt tcctaggcac tccttggcag 241440 
ttggctccag gccacaataa agagccctga tgattctctg agcaggccca acttccattc 241500 
acattcatgg gaatatgagg atgacttgtg atcttcagga cttagataga ctctttcagt 241560 
tggtgtaggg taagctcagc tcatctctag agacttccca cagtggagct catgcatgac 241620 
tctgtgcagg tggcacagga cagccagaga caggcaccta aggagtgcac taccagcagg 241680 
tggactctcc actgagggag gtagacaagg gagaaagcca ggtgtccaca tggagcaggc 241740 
cttactcagg gagactgaga ttttgctcta cacatcataa aacgtggagg tcattaatgg 241800 
aatttttcac aggacatgac tgccatagcc atgatagggt ccctggctaa tctttcaatt 241860 
rtgatagcat cataagataa ttttgaatat gaggagagaa gaataactct agaactgcaa 241920 
tatccaatat ggtaacccct agcaacctaa ggctacttaa actaattaaa attaaaactt 241980 
tagttcctca gtgccactag ccatatttca agtactcaat aaccacacac acttggctac 242040 
catgttggac agcactgaca tagaccattt ccattactgc agaaagtcct cctatttgac 242100 
agtccacctc tggaaccaat ccggagtctt tagggaccta aaagactgag gtgctaaatc 242160 
tcagagcagt actctgctta atgcttagtt aaaagacttg ctaatctata tagttgatct 242220 
agagaacttt aaaggcagct agttcgtagc aactatgttg ctggaccttg aaactgtatg 242280 
tcaggaactt ggtgttctag tggtgacttc tcttttatct accatgttca cacttttcct 242340 
accactcatt tatccattca gaaagtattt cagaagtccc tatgatttac caggtgctat 242400 
ttcagacact gaggcactgt ggggaacaaa acaaacaaag tctcctttct cagggatctc 242460 
acattccatg tggtgatgca gactgtaaaa attcacacac agaagatact ttcatacagt 242520 
aggaagtttt atgaagaata tttcaaaagg ggtgtggggt gggagaggtt gatgttcagg 242580 
tgtttgcttg agaatgaact gttcatacaa agaatcaata attcatttgt tccacaaata 242640 
tttattgagt acctgttatt agcaggctac tttattaggt tgccagagca gggttttctt 242700 
agcacaatta aaacacaata acatttgtta gaaattcaaa gacccatgtc cactcttgca 242760 
taatgaatca gagtctcgag gttatggtcc agagactggt gtatttaaaa atcactgaag 242820 
taattttaag gtatagccag aatagagagg acaccggcat tcttcctgaa actcaacaca 242880 
ctagacaatg agaaatcatt aaaatgaagt aaatattttg attgagattg ttttggcttc 242940 
aaaattaata atagcttgga gaggaaagaa ggtaaattat aaaagacaat taagaatctc 243000 
ttaaatgtat ttagataata gttacagagg gacaaaaatg agtaggatct gtgataacag 243060 
aaagaataaa aataaacaaa aggacttaaa aaagtaagaa taagaaagaa gtcaacagta 243120 
acccagaaat tttcaacctc atgtctaaga gaatgatgat gctatttagc aatatatcca 243180 
aaccaaaaat actacgagaa gtaggttgtg ttcacttgtg atgttgtaaa tttataatta 243240 
ctttttcttt attttggatg catgcttttc ctttttctga agttctgaga catacaaaaa 243300 
caataaagaa aaacatgatt ttatgccaga aaataagata ggaacaattc ctcatgctca 243360 
atataatttt agcctcccac ataatccaaa atatatgtgc agcaaattgt gggagtttcg 243420 
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aaattagatc caacttctta ttacagttaa agataaaaag aaaacaaggc tttgtctttt 243480 
ttaggatctt ttttggaact tgcagtataa aaatactctt tcaaaaaaca taactcaaca 243540 
tcaaagtcca ataaaacagt aaactggact aaatatattc gtatattagc ttgacctaag 243600 
tctgggaatt gagccagact tttcaaatta tactatcatt atttacagaa aaaggaaatg 243660 
ttaactaaca ttattaaata aaagcacata tacacatgcr taacaggaaa caaaatgaat 243720 
gttaatgtta acaaatgttg aagatgtgtg tgtgtgtgtg tgtgtatgtg tgtgtataca 243780 
aagcagataa gatattctct tgagcatgat tggattggaa atagtagcta tgccagggga 243840 
aattttatgc aatccccttg gctgtgggct acacatttat tgtgttatac aaatatgcca 243900 
aaaaaaatgc catttacaca gttgcctaaa acaaaattga aattatacat tagtgtttaa 243960 
agtttttttt tttctttcgc ctccttttct taatgaatgg cagataatga aatttaggag 244020 
tttgcaaaaa caaacgtagt ttaaattgta ccttagttgt ctatgaaata caaatacatt 244080 
ttatatattt atggaaacat ttgtgtgtgt tcagaaagtc catattttgc attcaagatt 244140 
gaccctgttt taagtcatat ttcatttgct acactcaaat tgactaaagg cagatacttt 244200 
ggcattcact ttttgattca ttattaaggc attattttga ttgattctgc ttaaggtagt 244260 
tttcctgatc ag'agtagacg tttaggccac ctatctagac aaatatccca ctttcctatc 244320 
tgaaatattt gatctctcct tacccattct agggttttgt gcatctctaa gatgatcatc 244380 
ttggtgtggg gtcctgatct tcccagtgac ttagcagcac agtaatttgg ctgaaatgtt 244440 
gatgagttca gacaagaaga aggtgatgat gacaagagag cgtcacgggg aaattctgcc 244500 
ctgaggaagg aatagtttga ggctggagag atgtgctacc cccgccactc tttctcaaat 244560 
ggtatttatt cttgaaagac agtgtaaagt gtctgaaatt gcagaagcaa agtggttgct 244620 
gtagaaatac ataccatgac gtgcttcaac tcgtttcctt aaaggttaaa gaaagaaaaa 244680 
tgtgcatctt gaagaaaatt ccaaaccctc attgtgagga gtaatccaat atttcaattt 244740 
agaaattcag agggttttaa ataagaagca attattctag gcaaaaagca aatataaatg 244800 
ccagcagtca aaaaaccagt gaaaagatat gattaaaact gagatgtaaa aaatagatat 244860 
ttatctgaag ttcttgactt ttaaagtcca caggcatttt cccttaagaa aagaataaaa 244920 
acattaaatg ttaaagaaaa acataagaaa tattataaaa aagataatta ttcagaaaat 244980 
attagtattt tgttaccaac accaaagaca aacagagtat ttttcaatga ctggaagcca 245040 
gtttaagtga ggaaaaaaaa aaacacgtta ttattactgc tattatccat tatttttatt 245100 
tttacttttg gcagtgttat ttgctaatca gctaacgcat ggtagtggac ttgatctttt 245160 
atgtattata aaggttgtat cttctcattt taagggaata ttttaactac attgtccaat 245220 
tgtacactat aaatgcagca agaccaggtc aagtgaaact tcagtgggcg agtctctttc 245280 
ttttcttgtg tgtagggctc tccctcccac ctgctacctt gactttttag attccctgtt 245340 
cccctggtct gtgcttgtat aatatacaag ctgtccttga catagcacgt tcctcttttt 245400 
tccttcatta tttttaaatg gttataatat actcttaaaa acaaatttga atattactaa 245460 
tagattctaa tattatctta ctctctgaaa ttaagtagtg atgtatattg tgttaaacgc 245520 
ttgtctatgc tgatgcaaca tatgggcagg tctctctctg ggcctatatg tatttgtttc 245580 
atatcgttgt acacagattc attcatttct ttaaatgggt atataatatt tcagtgtata 245640 
aattaatcac tcgtaagtgg ctmgtaattt aatttgttgc tacttcctcc caagatgaac 245700 
agtgctttag tgagcatcta gccatttctg tatctgtcta cttattgatt tatgcatata 245760 
ttcaccacct tcttcttcca caaatagact acacatcctc ttcatgagag tctacacaaa 245820 
aatgttacaa ataacatttt taacgcaata gttcccttaa gctcacactt gtaatgtttc 245880 
tgcccagact taagccagcc aaatgtgtgg aaataaattc catttgtgta ttaaatatca 245940 
aagataggga acattcataa caaatrtaaa satagccaag gattcattga aacatttcca 246000 
cataccatct ctccaaatct ggttttaaag agattgtaac ataatttcag gaataaagac 246060 
cttctggaat tgagatttat ggaaaagaga ttatagctcc acaagattct agggatgcaa 24 6120 
aaggtctgga ggagcctcca ggcacaggca ctttgtttcc atgctgaaga ttcaatgtgg 246180 
gttctgaaaa tgctgcttta gcaagcacac atttgttcca ctggtgttgg attttctatt 246240 
gattaatagt taccattttt ttcttctctt aggaaagaca acctgtaggc acagaaatca 24 6300 
ttgctctgct tacactgaaa tctcctgctg actttgagag ctgagcattg gattttgatg 246360 
ggagagtgaa acttagtgaa gaaatgattt gaacctttgt tgatctaata cttctgtgat 246420 
acagttatct aaacagaatg agtctaaagt cagtataaag acctgtgtct ctccatgttt 246480 
tgctgcttat aattgcagag gaacattttg agatagaaaa ttttatccct ctttcaagtg 246540 
actttttata aaacataacc ttttaagaga ggtcattctt ttgaaaaatt gtctgaatac 24 6600 
aaaaaagtaa ctgaacagca agggattgta gccataatgt tcttgaccac tttagattga 246660 
caaatagtat ctcaaaattg atcaggttct ggagaatcta ttgaggttgt atggagtaac 246720 
caagtgatgg aagaggttgg aagtgaataa ttttatgtgg tattatgtga ttttttttgg 246780 
tgggggtagt gtctggagct gagcatgatg gagcctagat ggggcagcct tctaatgctc 24 6840 
ccagtatttc gcaaatcata tgtcacgtca aaattgtaat gacaaagctc acagcagctg 24 6900 
cctacgccag gtttggtgcc agctctaatt agagcattga tcatttctct ccccttcatg 24 6960 
aatgcacatg agtgcacatt ttcatatata agggcaaagg acatttcaaa gtttggatgg 247020 
aatctttcct acttagaagt aataaaaaac aatactttga aatcttacca aatgttaaac 247080 
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aaatagcaac aatttttcca tttgaagatg tgtgtcttaa ggtctgttgg atctttcaaa 247140 
tttctatttt ggacatggaa actacagatg aaatttaact tactcaaagt tacacagaac 247200 
tcctggctga tctcttttac ttaattcttc taaaatctaa agtaagaata attttgaaat 247260 
tttattggta atgcttaaat aacaaaagtt actttaatat gttacatgtt tgaagtttca 247320 
tttatactta atatctaatt taacgcttgt gaaaatatta ttcagtattt ttttttcttt 247380 
tctaaaatga cttgttcagg gctccttttg ggattttaag tgagaaaaac tctatttcat 247440 
aatattgtta ccatgctcat ccaataacac ttcatagatg gttagccatc tctaaaatta 247500 
aattttgtag ttgaggagca ttgaaaatat cacctatatt tggatttctg tttaacactt 247560 
caaaatgcat tttatcataa tttaaaattt gatttaaata caaacctctt tatagaaaac 247620 
ctgataaatg tagttttatt tgtttgcttg cttttttttg cagggaggga tattcccttg 247680 
ataccactct gtctttatga ctcatcggtt atttataaat gttatcatcg tttttctgat 247740 
ttttaaataa gtttcagaat taatacattt aactgaataa atcaaccaat aatgtcaaac 247800 
ttaaaatgtg tttgtaacaa gagtcagcac ttataatgaa aaatataatt aattaagagg 247860 
ttayaaaact tataagcaga gtttactctg cttacagaag ggcaaacatt tgttatttga 247920 
tacaacatat taagtattta tcccatgcaa tcactgtcct aggctttgag tttatagtga 247980 
tgatcaaaac gatttacagg tttcaagaac cctgctcaca tacagcttac aatcctatgt 248040 
gggaaaatga taatagtgaa ataattacat taattaatat atacttgatt caaatgtaaa 248100 
taattattct gaaggaaaga cacctacatc aagagagctg gcttaaattc ggggatcaag 248160 
gaagcctttc caaataattg ttgatgggat tgaaagacaa ttgtgggatg agatgaccac 248220 
ctgggcttgg tatgggggag atacaactgt cagtgggaaa gccctggaga tggggaagaa 248280 
gaacttcttc aatggagaga agttcctgag tgagkagaag gcaggtgacc acagtgtgaa 24 8340 
gtgatgctat agaagccaac cacgtggcta tgcaagcaga aggaatttat ccaaaaagaa 248400 
agggggaggc attagaggga tttagtgtgg agagtggtgg tgacaagatc agatttgcaa 248460 
cttgcaactc tctctcaggt tgcagcatgg gggatgatgg gatgagccca gagtggactg 248520 
acgcatttcc agtagtccag gagagtggca tgtggcatga actgtgtgag gaagggggag 248580 
ctggggggaa gaagtggggt aacaacaccc tatttgttgt tttgcttctt acactttgaa 248640 
aaatatttca aagcaggaat gctattccag acatttaaca cctggaacag gaataagtac 248700 
caaccaaata tcgtgggaga ctgacgaatc agaatgaaca tcagctctgc atggcagtat 248760 
ggttttaaaa cactttcatg tctatgtaaa caaagtaaaa atgttaaaag attattttac 248820 
gcctttggtt atcagatagt cagactttta gtagtttata ttttattgca gtctgatgag 248880 
gtctgctttt aatagtagtt ttctcttctt cccccaaccc cattacatta aaatggcaaa 248940 
tgatactttt ccttgttagt aagcaaggcc aagttaggat tatcagaaag tcaataccta 249000 
aacacaaaga atagtgcagc caaacccgga aaatattgtc tgttttgcca agaggaagca 249060 
tctcactgag tatctggaaa gctcattgtt tctcaattat ggttatttta ttagttgcca 24912a 
taactcagct atagtcataa tgtaactgac cagcaccgat gatgaaatca ataaccacta 249180 
caacagagta aatgctgttt ggggtgcaag aaacatcatt tttatctaag tgaaagtaca 24 9240 
tctcatttaa atgtatttca agttgacatt ttgtatatat tgttaataaa cttcaatttt 249300 
actaaaatct ctgcaatatc taatcaatat actgaatgat atgctctccc cttttgtgct 249360 
caaataggat ctgtttatat tttagttaag aaatgctatg ccattttatt atattagaaa 249420 
aatatttact ggtaggaatt tggtttaaac tctccctgtt attcagctgg gaagtatatt 249480 
acaagtaagt ctttaaagac tacaattgga gaccgggcac ggtggctcat gcctgtaatc 249540 
ccagcacttt gggaggccaa agtgggtgga tcaggagttc gataccagcc tggccaacat 24 9600 
ggtgaaaacc ccgtctctac tgaaaataca aaaattagcc aggtgaggtt gcaggcgcct 249660 
gtaatcccag ctactcagga ggctgaggca ggagaatccc ttgaacctgg gaggcagagt 249720 
ttgcagtgag ctgagattgc accattgcac tccagcctgg agaacaagag cgagatttcg 249780 
tctcaanaat aaaaaaaaaa aaaaangaaa aaaaaagact aaaattggag agtttaacgt 249840 
atcaaattta ggagataaat ggtaaataat atgtttgtat atgaaaatat tttaatacaa 249900 
tatttgaaat gaagaaactg agatcacaca actttttgag cgttgagttt atacatttat 249960 
tttctttctt tctttctctt tctttctttc tttttttttt tttttgagat ggagtctcac 250020 
tctgtcaccc aggctggagt gcagtggcac agtctcagct cactgcaacc tctgtctcct 250080 
aggttcaagt gattctcctg cctcagcctc ccgagtagct gggactacag gtgtgtgcca 250140 
ccacgcctgg ctagcntttt tgtatttttt tagtagagat agggtttcna ctgtgttagc 250200 
caggatggtc ttgatctcct gacctcgtga tctgcccatc tcggcctccc aaagtgctgg 250260 
gacgtgagcc actgcgccca gcctattttc tttcttttac ctatatcatt tcattttttt 250320 
attgatacgg aatattttac atatttatgg agtacatgtg atattttgtt acatgcatag 250380 
aaggtgtaat tatcatcacg gttttgggga tatatatccc cttcagtatt tagcattttt 250440 
atgttttaag aacatttcac attctctctt ctcgctactt tgaaatatac aatacattgt 250500 
tgttaactat agtcactcta ttctgccatt gaacattaga acttaaaata ctttccagcc 250560 
aggtgtggtg gctcacacct gtaatcccag cactttggga ggccgaggtg ggcgggtcac 250620 
ctgaggtcag gcgttcaaaa ccagcctggc caacatggtg aaatgtttgt atttgtaaaa 250680 
ctacaaaaat cagctgggcg tggtggtggg cacctgtaat cctaactact caggaggctg 250740 
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^ca^ccao cSf^ 9 " CCtgggaggt g^tgttgta gtgagcccag attgggtcat 250800 
cctgggtgac aagagcaaaa gtctgtctca nagaaaaaaa aaaaaaaaaa 250860 
rcaSctcS rrT a a f ctatgtt tgcacccatt gatcatcttt tcttcattcc 250920 
atgaSaS cltlt^ " CCCagcct ctggcatcta tcattctact gtctacctac 250980 
S?c?wc« tc « at f at gagtgataac atgtggtgtt tgtctttctg 25104 0 

cSKtca a^SS* acat ^gac ctccagttcc atttatgttg ctgcaaatga 251100 
tnttnl t a * gntttttg tgcacaatta gtattccatt ttgcttatct atttgttaat 251160 
tgatggagat ataggttgat gctatatctt tgcaatggtg aatagtgcta cagtaaatat 251220 
gccagtgctg gtctcccttt ggaacaccga tttattttcc tttggataaa tacctggtag 251280 
tcgaatttct ggatcatatg gtacttctat ttttaatttt gctcagaaat ctccatagta 251340 
tttgttcata atggctgtac taatttacat ttccaccaac agtgtataaa agctcctttt 251400 
ttTnlr f CaC " gta tctgtcattt ttttgttttt tttagtaata gccattctac 251460 

tggggtaagg tgatatcaca tagtggtttt gattcacaat tccctgatag ttagtgatqt 251520 
tgagcatctt ttttatatac ttggtggtta tgccatttgt atgccttttt ttgagaaa^g 251580 
tctactcatg tcctctcatg tcctttgcct actttttaat aggattattt gtgagatttg 251640 
tgaggttgtt ttttgtttgt ttgtttgttt gtttgttttc ttacagttga gcLJttgag 251700 
tcccttgtat tttctggata ttagtcacct gtcagatgaa tagcttgcaa atattttctc 251760 
ccattcaaca ggttgtttgg agattctctt taaaatagct gcaacaaaca aacaaacaaa 251820 
caaacccggc caagcacggt ggctcatgcc tgtaatccca gtacttaggg aggctgaggc 251880 
gggcggatca cgacgtcggg agatagagac catcctagct aacacagtga aaccccgtat 251940 
tlnnnttT? tgcaaaaata "agcctggc gtggtggcgg gtgcctgtag tcccagctac 252000 
tcgggaagct gaggcaggag aatggcgtga acccgggagg cggagcttgc agtgagccga 252060 
gattgtgcca ctgcactgca gcctgggtga cagagcgaga ctccatccaa aaaaaaaaaa 252120 
aaaaaaaaac cctaggaata aatgtaacta aggaggtaaa atatacctac aagttgcact 252180 
aaaacacact gacaaaagaa attgaagagg gtgcaaacaa atggaaagat atcctatgct 25224 0 
catggatgaa gagacttaat atgattaaaa tgatcgtatc atccgaagca atccacacat 252300 
■ tcaatgtaat ccctatcgaa ataacattat tttttcacag aaaaaaattt aaaaatcctg 252360 
aagtttgtat agaaccaaaa aagagcgcaa atagccaaaa ctatactgag caaaaaggac 252420 
aaagctttag gcatcacact acctaacttc aaaatacaag gctatagtaa caaaaatagc 252480 
atggtgttgg tataaaaaga cacatagaga ccaatggaac aaaacagaga atctagaaat 252540 
aaaaccccat atttacagct agctgatttt tgacaaaaat gccaagaaca tacattgggg 252600 
aaaggatatc ctctttaaaa aatgtttgca ggaaaattgg atttccatat gcagaaaaat 252660 
gaaactggac ccctacctct caccatacac acaattcaac tcaacatgga ctaaagactt 252720 
aaacgtaaga cctgaagcta taaaaccact agaaaaaaac agggaagaca cttcaggact 252780 
ttgttctagg caaacatttt atgggggaga cctcaaaagc acagaggaca aaaacaaaaa 252840 
taaataaatg gggctgtatt aaactacaaa gctgcagagc aaaggaaagt attttcttaa 252900 
tccttactgg ttcttaaaaa gttttggatg ttgaccaaca catetattat gataattaaa 252960 
i^" tg9 ^ ! tgt Ca ! gCt ^"S^ata tgcgcaggtt gcccccttta ggaagagcac 253020 
TzlT^ t \ ctaactc V ta tttttctatt tcttaaacat agctgactat aaaattgcaa 253080 
agaaatttgt gtatgagcta gggaagatgt tgggttgggt tgagattcaa ggaaggagtt 253140 
ggactttaaa atggtggaat caccagtaga ggacaacacc gtgtaccaaa aacgttcaca 253200 
ggacaagcca ggttgtacac catgatatat gagttgtgat gctgatgaag gacatagtaa 253260 
?^!ff aa 5 a a ^^ caaa aaaaaaaaaa agaaaaaaag aaaagcatta atattaggca 253320 
atgtattttg gtgcctataa tttatattct tatcaatacg gatagctcca tattgatctg 253380 
tttatatata tctatttgtt tatatcattt gattaaaata tggcattgac cattggaaag 253440 
gtacatctca aattatcctg gttaaatgtg aaagtatgaa gaggctgtga ttataagcat 253500 
ttaagtttca atcaagtagc aagtgctgtc aagatggtat atccaatcag atattgtgag 253560 
?? a "'^ ga "9 c tataga ttattctgct gtcttaagag ataggctgtg gatattcaga 253620 
ttgtgatttt ggagtttaag agggcaaagt aaattgcctg ctgaactcac agtcatgttt 253680 
gagagagact gacaggaaaa ggtaaaaatg aatcatcatc tacaactggc tggattggac 253740 
ctgacggaat aagatccagc aggaaaggtg tgtgactacc attcctggaa ctgggttttg 253800 
ctgaatcatt ggacagtttt gctgaatcat tgagcataca agttttgagg gatacatgtg 253860 
catttagaaa gtaacgaata agaatattca taattatctg ctacttttca tagcccatgt 253920 
attaacacca aagcaatttt agatgagcta tcaataaaaa tattaaaaaa gaaatcacag 253980 
tctatttatt taacacacat tggttatagt caacaaggta atgtgttgtg ctgcaataat 254040 
acaagtaaac atccgtctaa gaatccattg tttagtaggt attactgtac gaatttttaa 254100 
caaagctgct ttaaacacta taccaacagt tttaaataca aaagcccagt ggtcaagctc 254160 
taatataaat gaataaatgt ggccttctgg aagaccctag agagacaaca ggacacagag 254220 
tggagcacag gaaagaaaga agtccacttc ctgcataaat ggcggcggca actctgtttc 254280 
acctgacaga cgcctcctcg aaatgcagtc ccatttaaaa cattttttga ttgttttaga 254340 
taaaacttat aacattgttg aagaagcaac taattctaga tatttttata attttaagct 254400 
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gccagaaaaa gtaagtccac aattggaaat ggcccaaggc ccccaggatg taccttacct 254 460 
atgccaatat tgagaagatt attccagtgc aaaaggtcca tgtaactttc attaacacta 254520 
tggtttgttg gaaacacact aaaccactgt tccataattc cttttttttt ccagttactc 254580 
aaagcaaaaa taccagaatt ttatgtttac aaatgtgagt ttttaataga aacatgcaca 254640 
gtgtatactt tatatattgc cggaattgac ttatcgaatg ataatttaac aagattgttc 254700 
atttgaagtt tatgcccact tttataaata aaagaatcat tttggttgtc ttcctctctt 254760 
tctctttatt tttcctttct tatttcagtc tacaactact cccccttctt ctggcccatg 254820 
tggagatatc ccatttccca acctgctccc tgttcagtta caaggaggga gggaagacat 254880 
ctggcagagg aagattccca gggaactatg aaaggagata gtgactactg cagagcttgt 254940 
tacaataaca gactattaag gctggtgcag tggctcacgc ctataatccc agcacttcag 255000 
gaggccgagg agtgtggatc acctgaggtc aggagttcga gaccagcctg gtcaacatgg 255060 
tgaaaccccg tctctactaa aaatacaaaa 1 attagccggg tgtggtggtg agcatctgta 255120 
atcccagcta cttggaggac tgaggcagga gaatcacttg aattcaggag gcacaggttg 255180 
cagtgagcca agattgtgcc actgcactcc agcctgggcg acaagaggga aactccgtct 255240 
caaaaaaaaa aaaaaaaaaa accacaaaaa acaaacaaca acaacaacaa aacagactat 255300 
tagcactttc tcatatccct tgtttgctta agaatagtat ttgaataata taacggccta 255360 
tattttatct acctactctc tgacttcctc tggtttggag gggggtgaat catcagaggt 255420 
gttgttgcac agtatgtgtg tgggtaaata gggtcccttt tctctactca tcttttctat 255480 
gcaaaataaa gccattggtt ataatgaaga ggaaatcaga ggggaagagg tctattttga 255540 
atttttgtaa ccgaaaaagc tcatgatttt atgtgaatat tccattggct gcttagcttg 255600 
cctacttact tgaatttgga gtttattccc aacactgcag cttacatgca ccaaggtttg 255660 
ggaacattct gtaaaatttc cttgaggaca aatatccgct ttgttgtatt ctttgttatt 255720 
tatccatttt gccaaattat ctgcaagtag aaatatcgaa ataagaagct ctttagcaat 255780 
ttactttgga tattggtttt cttttgaagg acagttatta aaatagcttg taggattact 255840 
cattttcrtt tttcttcttt ttaaatataa agcaatgtca tcactttttt ccctgtatta 255900 
tatttctcct caataattga tatgctacat taaaggaaca caaaatggtc ttaattatgc 255960 
aataatgatc aaggcaaaga gtgtttcctg ggaactaatg gttgcctgag aggaggtgat 256020 
ggcttgaggt ccagctggtt attaagccgc aggaaatgct gcaggccaag atttgtatta 256080 
tttctctgag atgaaaatga acccaaaaaa aggcaaaatg ggtttttctc cactaatggg 256140 
taaaatgaac tcattttgga tgcagaagac ctttatgtag aaggaatgta ttccataaaa 256200 
gcaaaatcaa aatgttcagt ggtctctgct ttttgtttaa aatatagtat agcacaaata 256260 
atagtccatt attgatttta aatcttgatg caactaaaaa tatcatgcaa cacttgagtt 256320 
tttgtctagt tctagaactt ccatagaata ttatgtccaa caattgtttt gatagctttt 256380 
gtaagctaag aatatatttt gtatgcattt aacattgaaa atggttactt ctgaaaatat 256440 
tcagaaaaca gctatctcat aacaaaagaa tacaggcatt attttctttt ttttctggag 256500 
ttcattcaaa aaatattggg gaaaggtgtc tcattaaaag atgcgttgag atttgtggtc 256560 
tctctgcatg attagagatg aatcacttaa ccttcttaaa gaaactgcct aatcaatttg 256620 
taaagtagtt agtatagctg cccccatctc tgatttgtag taggattatg caaagcagag 256680 
tattttgtag atattaatat tactaatata aaaatgtagt tattactata tttactttac 256740 
catcttaacc aagtgaaaca atatatttta tcatattttg ccatttttgc cacccctgtg 256800 
acatggggta tgtcacaggg catagtagac aactgaaaga gctcctaata gtcaaaactg 256860 
gaaaagtttc agaacaaaat taaccagtag tattagatta taacccaaaa tacaaaataa 256920 
atgtttctga attcattttg atttaaatga atgacagaat acataagtca atgggggaga 256980 
gtttattctt gcatgcagaa gaatccagat aatttatgtg gatgccctgc cttcaaagag 257040 
gcagagccta actctcattc ctggacagca cagagtgact tccttccaaa gagtacagga 257100 
ccgaaagggg tgaccaaaag attgaagaaa gatcaagcag ttatcatgag taaagcaaac 257160 
cttgacaagt ttatcgcaat gtttagaatt ttgttatatt caatcgagga ctttacaaaa 257220 
ttgtgttaaa ctttatgatg gatatgggtg tccttcttaa tagagaaagt tcaaaagccg 257280 
ggtgctttgg aggaaacatt atcttttgtc tgccactata aatgtgacta aaatagtacg 257340 
taataagcat atggccacaa acttaaatgc attatgttgc caagaaaata tgttggcagg 257400 
tgtgtagcag gccttgcaga agatttgtag gaaaaatcat aaaacaaagt acatagtgtg 257460 
agatgtttgc tatacatgag gattttttaa aaagcagatc tagtagaata tgatttactt 257520 
agagaagatt tgtatctctt gtgttcagct «gaattatagc atctaacaca gcctgtgcca 257580 
cacagtagga actcactcaa tttttgttga aattattcaa cgaatatttt attttaggct 257640 
tttgcaagat atgactttgg taataaatgt agcacactta attcattgaa cactaaagga 257700 
aacaaatgct aaagaaacta gtttccaggc acaaatgact tgcttaatag cacaaggcac 257760 
atggtgggga ggggaggtaa ctacaagatg ccaccctgaa ggagtcactg tttcatctgg 257820 
agcactagag atagtgggtg ctattgatgt tatctagcaa gactgagggg ccaactttct 257880 
gagttcaaac cgtggttttt ccacctgtaa atgaggaaca acgataggac tggcctcaga 257940 
gataacccat gaggactaag ttggttaatg tagagtactt agatagctgc ctaaaactat 258000 
gtaatcatta tgtaaagata agctattatt aattttaatg gtcataattt aaatgaacat 258060 
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tctaggatat agttgcagac attgcaatac 
ctatccagga gagaaattgc aacacagatg 
tattatggtt attaattcaa gacctataaa 
tgtccatatt taaaaacttg cttaaaattc 
gtctcttgtt aaaaaaataa caaatcagtg 
tgaaaagttg atagccaagt aacaaatgaa 
tgcaatggaa aaattaacac agggatttta 
taattcttat tttgacaata tttaaagcca 
atccttaaat tgctgtatct cagggtacta 
tttcactcac ttgggcttta ttgaagatat 
tattataatg ttgtttctga agaacatacc 
tttatcattt agatttctac ttttagtgta 
aatgacatat acaaaaagtt gaagatgtgt 
atcaaacata tttattattt atcttaggtg 
taaactatat attactttta cttgtcacca 
aatcttttca aattgtgtgg cttcaacttc 
gaccatattc tatcaaggac acaaatactg 
cacagggaag gccttagcct gccttttgga 
tgtacgataa gtagaagagg aagaaataca 
cttattgacc ttaacttcct caatcaacta 
atttttcctt ttttcttatt tatttgtttt 
tttgtcacag atagatttta acagcttgca 
cttctattaa aagattggtg taaagaaatt 
cagaactttg ggaggccgag gtgggcggat 
ctaacacggt gaaaccgcgt ctctactaaa 
cgcgcctgta gtcccagcta ttccggaggc 
tgcagtgagc tgagattggg ccactgcact 
tcaaaaaagt aaataggata attatattct 
ctgacatgaa taagggcaga gacggtagca 
ttcaaggacc tttcattcga gcataggaag 
aggccaaatg tttgaaatcg tgaaagcttc 
gcacaagtga tctacagtga ctttgtggga 
acagtctata acttgtattt gtttcttttc 
gcctatgtct gctatctcct qqaccctccc 
ctacagcctt tcactgtaat aaaccattac 
gagtcctttt agctttcact gaatctaata 
aaaaagaggc tctagagttc gtcagctggg 
gaaatgctga cttccttaca aaaactaaaa 
ctagtttgaa taacacataa caagttttga 
tcatggattg gaggacagtg gcagggccac 
ccagtttaca gacatcagga agttgccact 
aaattcttcc agttaaaaat ttatttatgt 
cccgtggaat attttaagtc tcacagaaaa 
aaaatcagat tatttttctt caaacatcag 
ttatagtgat tgatggagag aagttatttt 
taccagagca gacagctatg agctgtcaga 
ggaccttgaa agctgggaga gaaaggcaca 
cactttcagt ttcactacta gctgccccta 
taaccgattt ttaacccgcg ctgagtcttc 
caatcacgtg tctacactag atcaagtgca 
gtagggtgta aactgtcact aaggaatgtt 
tcactgtact tgaggcagta ttggagaact 
ctggaaaaga gacggctttg tttcctttca 
tctcctctgt ttttctatag aacccagttc 
gtattattaa tgatcaattc atctactcct 
ctccttcatt ctccactatc tacacagact 
aatgccttct tcatatatct tgactagtct 
ctttttcata agagttgtgg attagaaaac 
ttgtttgttt tccttaaaaa attaatactt 
tcccagcact ttgggaggct gaggcgggcg 
tggctaacat ggtgaaaccc cgtctctact 



accaaagtca tttatgttat aatttatatg 258120 
tgcaggcacc ccactatcac cccaaattaa 258180 
atagcatgtg tcatttataa aactctcaaa 258240 
ttaaaaatca aaaagttaat gggttttgaa 258300 
taaatgaatg acagtttaat atcaaaaaga 258360 
attgttaagc agggttgaaa aattaccaag 258420 
tatttgtaat tattgattaa cttcttctat 258480 
aatattggca tatattaaac tgagaaccga 258540 
aaccaaaatt ttctaaaacg ttgaaacaaa 258600 
tattgcttta tcctttagtt agaatgatgt 258660 
ataattaatt aatgttaatt ccccatttct 258720 
tttcatgaag gacatgaaac attactattg 258780 
taatgttctt ttattaggct gatggaagta 258840 
aagtttaaat aataaagtct tgtatactat 258900 
tcttttctct aagcctggtt tagaggtaaa 258960 
caacttgctc accttctctg ggtaagagtt 259020 
tattccttct tcccagggca tttatatagt 259080 
cttccctatg ctttaagatt tttttcttta 259140 
tcaaaagaag tttcgctgca attcaactac 259200 
aaagtatctt tctacatcat tcatttcagc 259260 
tcttttttct tttctatcat ccatttgact 259320 
ttgtatgttc attttcctaa aatattagag 259380 
aggggtccag gcggctcatg cctataatcc 259440 
cacgaggtca gaagattgag accatccttg 259500 
tatagaaaaa tttagccggg cgtggtggca 259560 
tgaggcagga gaatcgcttg aaccaggagt 259620 
ccagcctggg caacagagca agactccatc 259680 
tatattcatt aacatatttt gatggatccc 259740 
agttaagatg ggccagttgg atttcatgcc 259800 
aaaaataaga gtatacatat ttataataca 259860 
cttctgtatc tcaaagagca atcaactctg 259920 
gcagttggca tctgaaaaac atcttaaaaa 259980 
acttacatgt tatgaaagaa gagaaataga 260040 
ctagtacatt tttccattgc taactttart 260100 
taagagtgtg ttagtgtttc taaattcttt 260160 
tttgtgttag gaccccccca cacaccccta 260220 
acattcaagt ggccaggccc aatatattca 260280 
gctttataat tttcaaaaaa aaccccaaac 260340 
gtgaacgctt agcactgtct caccatgagc 260400 
acacgtgccc tccctgtagc caaatgtagg 2604 60 
gcgatgcagt cattcttgac ctctgtgaag 260520 
ctacaagcat cctctttcaa atgcaaccac 260580 
gtcaccttta ttttagcatg aaatggctta 260640 
atataatttc cattaaggaa atagaagtct 260700 
ggcataaaag gaaaatgtgc ttgggtggaa 260760 
gagaagaact tgtggggagc agaggagagg 260820 
aaaacagcaa tgcaagaagt atattttaac 260880 
cagagtgatt gtgattaaag aaactcttac 260940 
catgtatctt agcttgattt tttttgccct 261000 
ccatctacca aatcaaacac tcgtttttaa 261060 
tgggtcaatg tccacctata gatactgctc 261120 
tatgttttct atggaagact tagagcatta 261180 
tattaaatgt caaggagagg cggtaaagcc 261240 
agggctgtgt ctcatcactt actatcctgt 261300 
caccagactg caaagttttt gaggaaagaa 261360 
acgtataaat taacaatgaa taaatgaaca 261420 
atatgttgtt agcattatat ttaaatacca 261480 
tgtgaaggga ttgtttgttt gtttgtttgt 261540 
gcggccgggc gcggcgggtc acgcctgtaa 261600 
gatcacgagg tcaggagatc gagaccatcc 261660 
aaaaatacaa aaaaacatta gccgggcgtg 261720 
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gtggtgggca cctatagtcc cagctactcg ggaggctgag acaggagaat gacgtgaacc 261780 
cgggaggtgg aggttgcagt gagccgagat cacgccactg cactccaccc tgggtgacag 261840 
accgagactc cgtctcacaa caacaacaaa aaataataat aatacttgga caggaagacc 261900 
acagctatca cttctagatc taaacattct tcaagatgag tggagacaga ggtagagtgt 261960 
gtgctggatg gatataggag tggacttact tggttgtcat catacggttg taccatttct 262020 
ttttaataat gactgaagat gacattggga ataaataaac atagaccttt gagtttcata 262080 
gccacttaac tccctagttt tgtttttcaa attagttgag aggtgatact tatagccata 262140 
actttaataa cattaccatc atcatctcaa ttgtggcagc tacaatgtca agaaactcac 262200 
aaagttaaaa tgtgaagaaa ctttagaggt ggtttggtgt ggagaagatt gatgttgata 262260 
cgattcatgt atctttcagg attccatggg attgcactct gtcatcaatt tgctctaagg 262320 
actaagtctg aaaaaactta ttgtgtgtca ttactgaaat actcagcagt ggtttggttt 262380 
ctgaatactt ttaaagcatt ctctctcttc tccgcattat accataaaat atatcttagc 262440 
atattattgt tttcacttgc catttgttta atatatgaga atatattaaa gcatatgcct 262500 
tgaatcaaat gctaaccaga ttaaactata tctaagagag tgtgtctttt caaaacagat 262560 
tgtgaagttg tgttctggtt cccatattac tgccattgtt caatattttg ggacttctat 262620 
tttggaaatg cattcagttc caacttgtag cccctaagaa atgctcccta catctaaagg 262680 
ctaagatgcc ctcctttatt atttttctga tcaaatattt aagaaataag cttcaaatat 262740 
ttgttaaatt tttctcaaaa gtaaaatttt tatgtttaaa atatgacaat ctaattgtca 262800 
tctttaatgt attaaaatct tttatatcca gtttttattt tattttattt acttatttat 262860 
ttatttgaga tagggtctca ctctgtcacc caggctggaa tgcagtggca caatcttggt 262920 
tcactgcaac ctctgcctcc taggcccaag tgatccttcc accttagcct cccgagtaac 262980 
tgggactaca ggcccacatc actgcgcctg gctaattttt atatttttag tagagacggg 263040 
gtttagccat gttggccagg ctcgtcttga actcctgacc tcagttgatc caccctcctc 263100 
agcctcccaa agtgctggga ttacaggtgt gagccatata ttcagttttt agaagatcat 263160 
tagaattcaa atgttgccag actttaagtg tgggtcacta tctgatggag ggcttctctt 263220 
agaagatgtc cccaatggca gggccttgtt ataaaggttg gaagagagat aatcctctga 263280 
ctaagaggcc agagaagcag aagatttttt ttcatcagat ttcacgtcag taggaagaca 263340 
gaaaggagaa ggaagagaag gcatgaacag catatgtgtg aagtagaatc ttaatagtgt 263400 
gcatctttta agtcaggatt ccgtctactg ttggccaaaa taaatcatta ttatctggtg 263460 
gttggaaaaa ataaggacat tccttggttt tctccaaaaa cttattttgg tgtttcttaa 263520 
ttacccttag gagaccttaa tgatcaacac tgtatcaaca gagaaagtgg aattttattc 263580 
tcttatagaa cagtgatgtt attatgcaat gtattcacat aattatttac agttcttcag 263640 
ttctagatga ctgatactac agaaacctga gataatccct gcttccatta gtttgggtgg 263700 
aacttttgtt gagagacttg agtcatactg aataccttga attactgcca agagctggtg 263760 
atcatcaagc ttgagttaat atcaatgtta tgttgctatg gagaggataa ctgcaggaga 263820 
acaatcatca tgtggtaggt tttaccacgt ggcctgctga actccatgct caaactccaa 263880 
ctacttttaa gtaatgacat taaaaatcat attttccatt tatctgtgga agctagtgag 263940 
taaagaataa gatagagtct actgaaccta tagaatgtta ctaagtatta ggtgttgtat 264000 
ttaatttcca gtttgtctaa tgctggaggc aattttacct tctcctacca gtgaaaaagt 264060 
ttatgccagt aagacttgtg atttttgttc tcccccaaca catgcttcta tataaattta 264120 
ttatgggaaa agtgatatta ataaaaactt tctggaacaa ttatagatag agtgtggtct 264180 
ggtgtgacag atattttgtt tcttcacaac attcattctc tgagtattca ccatggatca 264240 
gatattctgt agtttctgta gtagatacaa aagctttaca aggtgtattt ttccacaaag 264300 
aggttagcta gattaatagt gagaggatac atcaaagaac atggaaatga tttttagaga 264360 
aatgagagca aatactaaag tattgaatta agtaaaacaa gaaaaatagc tcctcccacc 264420 
ttcaaaaaca ttggtgagac caggattgtg gagggcacct tggcaatggg ccttgtagct 264480 
cagtgaatgg ataaaatgtg tacacagtga gaagagaaga aagtgtctac acagaagtta 264540 
gtttttgttt aatttgatat tttctcttta cctcgttcta gtcaagatga cctgaagcct 264600 
tctagtgaag aaccagccag gcaagttcca cttcttaggg ccgttaaatc ccttctctta 264660 
ttaatgatgt taattgcatt tggatctcca tccagatatt tgccctacct taatttcatg 264720 
ttaatgtgcc atgggctaaa gaggcagtat tttttaaacg catctgtaag tttcatttat 264780 
gacagaattt ctgtaataaa agtgtagtca agagtgttga tggaaaatta gggtgacata 264840 
aatacgtgag aactggaaaa tgtataatct taaacagtac acattatttt tattatctct 264900 
ctttcccaca ggctgatgag gcctgggata ctaatatnaa aaaccaaaat tagaaacttc 264960 
tcctctgtgg tgtttattgc ttcgcctgca gtgcatgaca tcttaattct aaagctctna 265020 
actcacaaaa tctgacatcg ttttaccctc tgacacgtca aaatcagggc ctaattgntc 265080 
ctgtggttgt gaattatgaa caaatatact ttctaagaag atggattttt catttaattt 265140 
tccttttcag cttcacaata tgtacatata cagaggttgc ctgcgatttt aatatgctca 265200 
cacctacaaa aacacacaat acacacaaac tttcaaacgt aagcacacag atctgtaaac 265260 
atacacatga agatttgttg caacacaatt cattctcaaa tctatttcct gttatgctta 265320 
tgattaagat ctataacttc tcagggttga tttatttgtt ttgtacgaga ctgagtaaag 265380 
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ctaaagacca cgaatatgat gggtgttgct 
taatctgcca tatttcaaat attccctgtt 
tataaaaata gaaatacatt tgtgactttc 
aggttattta gaggactcat gtaagacaga 
cacagttatg tggagaaagt aatctgccac 
tggctccgtc ctcgatcact tttatatccc 
aagcagaaat ctctgtactc ttttgacatc 
catacatgtt ttatacagat gtgggcttgt 
tggaccttat catgctattt tagttatcag 
cttctctcta aaaaatatac aaaaatcaat 
tacctttttt aggccttgat ctcacagcct 
gctgacagag catccaccat ccaaaggcag 
ttaattatga cataatgaca ttaataactt 
aataattcca aacttattat tataaccata 
gtaggacctt aatttatagt ttaaccattc 
caaatgtata catagtttta ggggtccatt 
tttctctgta ccttctacta agtaacctgg 
tatacataca tctgtctctg aggactttta 
ttgctgttgt aacagactac tgctactaat 
ttacttttca gtatttctaa agagcttttt 
taaaacattt tatatagaga ttcatataca 
gtatcttaag tgatcttgaa atcaacatta 
ctgatctttg ctcatttgca ggagcacctt 
catttgggag aaattcacta tacaacaatc 
tgtttaaaat atctatttct ctattcaaac 
acacatttcc aaataagtaa aaacaaaatt 
tttgtttcct gtcttctatg ttttgattat 
ttagttttga accataaaaa taactcattc 
taggcatata gttgtcagta atctgatttt 
agggtctctc tctgttgccc aggctggagt 
ttaacttccc aggctcaagc gatcgtccca 
actcctggca aatttttgta ttttttgtag 
gagtcaagat gtagatggca cattagttca 
tactttgtgc catgatataa gcaaggtgct 
tccaaaccgt acacttacaa gtgaggcaac 
ggaaagcgtg tgtgtgtgtg tgtgtgtgtg 
attttcctac gagtatccta gttgatcctc 
tgctgtgtcc tttccacccc acccttccag 
tgtaatgaag cctgagatgt tgttagaatt 
tttcaatcca caaaacgaaa attagatctt 
ttgagaaacc ctgacaccag atctattaat 
agtttgctac tatttggtga tgacaccacc 
aagatgttaa aggcatcatt ctcttatttc 
cacacacaca cacacggttt ccatagtttt 
aacattataa ttattattaa ctcctaggaa 
taatttaagt agacacattt aatgatgcaa 
tatacttact cagaataaat aatgggaatt 
tgatcttatt ttgctttggt tttagctttc 
tagattcacc tggaggtctt atttcataat 
attaagactg gggtgggctc aagaatttgc 
tttattctta gtagatcttc tcattaaaaa 
aagtcaaaat ctctatcgca ggattcaaca 
catagctttg ggacctaaca ttactcttgt 
gaaatgtggt ctgttctctg aatataccat 
ctgtattttc tggccagaaa gcccttccac 
ttcaaggcca agttccaata aagtttcttc 
agtatataaa attgaggatt gaggaaatcc 
aatctaagat cattctgtta atgttccacg 
gactcaagtt gaattttgaa tcacaaagtg 
ttttatacca tgatttttat gtaatacatc 
ttttaataat gtgccaattc acaaaagaat 



aaaataaact tctcaagaga catcaagttt 265440 
gaaatataaa cataaaggaa ccattaatta 265500 
agagctgtca ccgaatagat attggaaggc 265560 
aaagaactac gcacagatca ccgatgatga 265620 
aggtcaccca gacagggagt gggaaagtga 265680 
tttcattcca aagccaagca tgttctataa 265740 
ctaactcctt ctgctaccct catttaccct 265800 
cagaaaaatc acaaaatttt aatgttagaa 265860 
ctgctgagct gttttctatg tttgagttcc 265920 
agaactttca gtgcacttgc aggtgggttt 265980 
gatccttaac tcatctactt tccaatactg 266040 
cgtagacaaa ggttcaatga gtaggaactc 266100 
ttaccttatg ttttatgttt ttggagacaa 266160 
aacataaata t'ttgtatcat taataacagt 266220 
acatttgagt ctttaatccc tgctcaattc 266280 
tttatttctc tccacatgta gtgatttatg 266340 
ccttattcca ttatatattt agtttccata 266400 
atttcatttc attggtcaat gtatctgttc 266460 
actactgctg aattactatt attattatta 266520 
aaaaatatgt cattaagtta ctgttttaag 266580 
tattgaactt atgtttttat tttcttgcat 266640 
taacaacctg ataaaataga gtgaaggttt 266700 
atataagatg taaattaggt aaacgtttaa 266760 
tgattctgag actccaaaga tcgtttttta 266820 
tttcatttta tttgccgcat ctaatatctg 266880 
caccaaatct tgtatctttt aaaatgtatc 266940 
ttcttttaaa tttttaaaat aaatttgtgt 267000 
aaagtcatca aagtaaaagt aatcatctct 267060 
aacccaacag actatataca tatttaagac 267120 
gcaatggcac aatcatggct cactacagcc 267180 
agtagctggg gactacaggt gcatgcttcc 267240 
atgtggggtt ttgccataag acgaattgtt 267300 
cgttatttca gttaaggttt cctgagcctc 267360 
aagtctgtgc tgcaaacagc agtcatccta 267420 
aaaggtatat aggggttcgg gaaaaactac 267480 
tgtgtgtatc aggcagaaaa ttatcctcac 267540 
tgccaccatg ctctttccat gggtgccctt 267600 
agggatctgg atgtggatgc ctatgctgag 267660 
taatgaataa ggccaagtat gctaacattt 267720 
gacctacaca aaatgtggct agctcctctg 267780 
acctttccta tggtcacttt atgttttgaa 267840 
aactaggatg aatataaatg ttcattttca 267900 
taataattaa ataataaatg tggagataca 267960 
tttttcaggc aaattcgcta aaatctcata 268020 
taatatctac atttaaggaa agagtaattt 268080 
gtatggatac ctcatgcaaa caaatgcctt 268140 
ccacatggaa gcttatgaga aggtctttaa 268200 
tgagtaaaca agcccctaca cactgagtat 268260 
gtatccaggg ctttagccct agacatgctg 268320 
atttttcaca aattccaagg tgatattgat 268380 
taagtgtaac cctctgcaat taacagaata 268440 
gagggatgac aatctggccc aaccaccttg 268500 
tttacagcag tagttctgcc attcgtaagt 268560 
tctctttggt atctttgcac tttgatcttg 268620 
ctttacagtt tagaaactgg ccacttaccc 268680 
taagaagagt gctgcctatt tccaaattta 268740 
caaaattcca gtacatttgt ttcataatgc 268800 
tgcttccaac ggctccacga agtggtatga 268860 
tattcattgt ccacctgaca tcttttttta 268920 
atttgtgtct ctaagagttt ttcacgtgga 268980 
ttgacttaac acttctgagc attattcatt 269040 
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gatacacagt tgaaatacag atagtttaga 
atttttttta aagctcaaca tttcctagag 
ctctcttatt aaaaaatgtg ttaatttaaa 
tgtgtgtgtt ctgtgtgtgt ttgtgtgtgt 
cctaagttta caagattcta aaccttttca 
ttttggtaga gaaatcttaa tgaagcttaa 
ggttaatctg cttcaaatga tggtggatgc 
gctgcctata gtctccctta aaggttaagg 
tcaggctact gtacatcaga gcactgacaa 
aggacctatt cagaaatgtt ggtttataga 
acagcatggc ttaagtgagg aatttttact 
aatgcctatc ttaactcttg agatgtggcc 
ggttttctca gacatgtgaa tttggggaga 
tgtggcactt gtcagattaa caaaaggagt 
tgatagagtt tcaagttgag aacagtataa 
gggtaagaat aaaaggtgtg tgagccatag 
agcaatagga gctaaaggca ctctaaaagc 
tcttttgttt tcattttaac tttctgcttt 
atttaatggc aagttgcctt taaaatgtat 
tgattttgca taaataatca cagagaatgg 
taattcccat caagagcgtg aaagctacaa 
tcatccagga agtctctttt gttatctaat 
tttttttttt tttttttttt tttttttttt 
ggagtgcagt ggtgtaatct tggctcactg 
tcttgcctca gcctcttgag tagctggatt 
tttttgtatt tttagtagag acggggtttc 
tgacctcata atccaccacc cccctcagcc 
cgctgtgccc agcccttgtt ttgttaggtg 
agtggtggtg aagttgtatt atgttttgaa 
gattaacatg tttttaatca tttatgcact 
tgtaaagaaa gaatacactg catgtaacac 
gcagaagata tttaaacagc ctggattaat 
gactgaatga gttacaaata aacaagttcc 
tgtctcataa ttgtatatgt attgcattta 
acagttataa cgtgcatgag caaaactata 
ggtgaaacag ccactctagt tatgtaaagt 
taggacgaca ttgacatcag catcagttat 
cttaatcaag agagacatga cacctggatt 
aggtggattt ggtgaagtcc aggtggggaa 
gcaatctgct cacagatcca caatattcgg 
tctgtgatgg aatatcaatg cttccatttt 
taaaattaaa aaaatatata tatagccagt 
taaaaatgca aagcatagtt tctgtctttg 
aaatgtagaa gaagcaatta taagcaggat 
tgttgtacaa atcaattctt gtgcctaggc 
tccacattcc aagtaaattt actaattcaa 
cagaaaataa gagtgaaaaa tttcaagaat 
gaaggcttgg attagcaagc atttgattga 
tggagaggtg gggaggagtg agggttatta 
aacacaggtg aaaaatgagc atgaggggat 
ctggaaatgg aagttttgca tggagagcat 
gaaataacag ggctgggttt gaagctgttg 
gactcagaga aaaggggtgt ttggaaaatg 
gcaagattaa actggtagct atcatttcca 
attaaacaaa accaaatgct aacatcagca 
cctgaaattg caaagatctg gtagaataga 
gttgcttctg ctttccatgg ctctccttat 
ctccccattg tcagcctgga acagaaaggt 
catagatgga caagtgctfct aggaaactac 
tctcatttag aacaaacatg gaaaggaagg 
atcaatatac taataattga gactgagtga 



taggaggaaa agtcagtata aaggaatgaa 269100 
attttacttc tcatatctaa tggtgtaatt 269160 
ctattataat tttatcagat gctgatttcg 269220 
gtgagagaga gagatgatct gatgtgtatt 269280 
cttataaata gatagttcag agggtgacat 269340 
aatgcaagtt tctgtgtaga ggtaccatct 269400 
accatcatta cccaaatggg tgaaagcctg 269460 
agtgatatat gggtgctaaa gaaaggctta 269520 
aacttcctca ggaaatcacc aaatgaaaca 269580 
tatgccatta caatggaact catctaaatg 269640 
cccaggtcag gaaattctaa ggcaggaact 269700 
agaattcagg ttgtttttct actgtgtgct 269760 
taagaacaga atagcatatt agctgcaaca 269820 
gggaggcttc atacaagtct ccattttccc 269880 
tgttcatatt ttacatcctg atgggaaaat 269940 
aaacagaatc tgtttctttt cactttttat 270000 
tcacttgaag gccacaaggc cttgcttttg 270060 
tttacaaaaa aatctatatt ttgtcccatt 270120 
ctggcctcat cacagaaata ctaaggtatg 270180 
catttcttaa gttaattgtg tctttctcac 270240 
tttaggtgat ctttcagacg attagtgtag 270300 
ctaagtcact ctagctataa ttcccccttg 270360 
gagatggagt cttactctgt tgcctaggct 270420 
caacctccac ctcccgggtt caaacaattc 270480 
acatacacgt gccaccatac tcggctactt 270540 
accatgttgg tcaggctggt ctcaaactcc 270600 
ccccaaagtg ctgcgattac aggcatgagc 270660 
agaaaatgag cctgtttcca catggagggc 270720 
aaatactgaa agcaaacagg gagggaagca 270780 
aatattgtat acaatgtgtg tgtgtgcctg 270840 
agtaaggtat tgtatttata tattgctcag 270900 
tcactcatct tgtttgtact aagaaacaaa 270960 
ccaaattgta ttattgtggt ctcatttgag 271020 
tcacaaaagt tatatttgaa cacaaggtgg 271080 
gtgagccata tctagcgatt tagctttcca 271140 
gataggaaat aaatcaatcc catcttagcc 271200 
tccaataact gcataatcaa tatttatggg 271260 
cgctgcatgt gagaagggag ggcaatgtca 271320 
agcagagtaa atcagcagca aagtcaacaa 271380 
ctgctttata gaacacagct ttattctttg 271440 
gggaaaagtc taacgaattc tttagctttt 271500 
ttgtattttg tattttttat aaaaggcata 271560 
agaagttttg aatcttgttg gtgagtcaag 271620 
ttgaatccta cctgtatcac taatttgttg 271680 
caaaataaag caaaaaaaga ctacactggc 271740 
tggtactata aaatgtcaat gaatatattc 271800 
gaaggaaaag aagagtcaga tgtggtgaaa 271860 
tattaaaagc aaggataaga atttgctgaa 271920 
tagaaaaaaa atgaagagaa taggaaaaca 271980 
cacctgggat tgagaagatg acatcaactg 272040 
caggtcagtt tacagaacgc catggacact 272100 
aaaatttcaa tcaggaatcc tttcctatct 272160 
ttcatctgaa gcaggggggc ggggcgggga 272220 
gatatgaccc agtccgtaga atgtttgatt 272280 
ggtccacaac aaacagaaga tttacaatct 272340 
tttcacatgg caatagttgg cagaagctaa 272400 
ctccacattg ttgtgactga agctggcctg 2724 60 
gtttgcatca gagatcacta ctgtagggac 272520 
taccgaggtc tcatcctgag atgagaaggc 272580 
attgtatgtg agttttgttt tgaaagcaga 272640 
actagtctag gaggcaaagg agaggaacta 272700 
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gcttacacac gaggaagctt aaaaatattg tttgtgtata ttaaaaataa aataaatgcc 272760 
taggcatggt ggctcacacc tgtaatttca gcattgtggg aggccaaggc gggagaatcg 272820 
cttgaggcca ggagtttgag accaacgtgg gtaacatgga gagactccat ctctacaaaa 272880 
aatttaaaag ttagcccaat gtggtggtac ttacctataa ttctagccta ctagagagtc 272940 
taagatggga atcacttttt tcctggagtc taagactcca gtgagctgtg actctgccac 273000 
tgtactccag cctggatgac aaagtgaaac cctgtcttac tccaaggaag aaaatgaaaa 273060 
taatactttt gaaagtcctc agatagaaga ctcttacctg cttcatttta ttagtataat 273120 
atatgacaaa ttaaaaatat taaatttaag tgtttatttg ccacttactc aaaattttag 273180 
gtatacacag gttgacccag aaaagaaaca aatgctaaat ccaaaaatat tgtcgtgctt 273240 
catgcctaca ttacttttga gtacacatat ttcttctgta acccattata aggagttttc 273300 
aaaatataat atgttcttaa taacttacat taaataattc agtggtaaat acaagttata 273360 
atatattgaa tgttgactgg tcaagagttc ccttttttca taaaacagat ttcaattctc 273420 
atgttagttt ggccaactga aagtaaatat gcttggtaga aaacagtcta agtgagacat 273480 
tattaatgaa ataatatgtc tgaaggtctg gtactaatgc tggatcacga atataatttt 273540 
tgaacatgaa gagctactaa atgtaactga gtttagactg ggaaactttt ccagtgatcc 273600 
actaggtact agtttagttg catggctgaa tgacccagct atttgccaga ggagagggaa 273660 
atgatatgta gctgactgtc aaatcatatg ctatttatta tagttatatt ttgtttgttt 273720 
taactcttaa gtgaattgtg agtttctgca aatcagaaaa ttgtttatta atgtttgttt 273780 
gtactgctgt gcttaattca aaaccaccta atgagtgatt tttgaaatca acccaattca 273840 
tacatgatgg gtagatttaa aaaatagttt tacattgttt cttatttcgt ttagaactgc 273900 
aatcccgaga taggaagatt gcttcaaaaa ctataacaga tttctgaaat tgagtaaatt 273960 
tgcttcaatt gtctattaaa agttttgaag tataataaac aaaaaaatat attaagtaaa 274020 
gcaaggttca aaatcacgtt tccagtaaac tttgcatttg cagtacattt atgtagacaa 274080 
aacaagtcca caatagttta aacagacatt cacagcaggt gattccaaga ccatgggaag 274140 
gatttggttc ctgtttattt tctgagttgt cttggatgct ctccttccag ttatgtcttc 274200 
ttctctttct catgtatcat gtcttatttc aactattctg acagcatgtc aagcattagc 274260 
agcagaataa tctgtatgca cagtgggaga gagaggcaga ccaaatacac attgaataag 274320 
gaaaaagaag tttccagcgg ttgatacatt attctcccta gcaaagggct agatttgcta 274380 
agcatatgga gacatgctta aggatcaata gggagaactt tgaaatatat gcccatcctg 274440 
tgaaaattta tatcttgact gaggatatgt tttgatagta taagtaatat tctcataaaa 274500 
tttttggagt gacacatgag aaatggagga aaccataatg taaaaagcac atgttaagtc 274560 
cttgccaatg attaagccat ccttttgcct tataaagaca ttgattttca ttctatcctg 274620 
atcttttaga gtagttgggn ttctatatat cttgagaatt cctatgagaa taaaacacag 274680 
ctaaaattag tgtaatgagc ttaagttaga ggaaagcaag ataccctgga tcttggataa 274740 
atatacttgg tttaacgaat ttcttaagat atatccatga ggatatcatt ttccttacca 274800 
aaatataatg catatcttta tttatttgtt tttttaacct cagagcttct agggtagact 274860 
caaaaaagac attctgctag ccttatgttt tactatgttt ccataaatgc taacatttct 274920 
ttcaaatatt tacccccaag gcttatttta gatgtatcag ctaagggaga ggagtcagaa 274980 
agatacctta taatgaagtt tcctcttctt tttcttatat tcgtaacaat ccgataaatg 275040 
gtttctgaat tatcgagtag acctctgaga actcagcagg tgattccaaa tcaaattcac 275100 
aaccttccaa gatgacattt tccttagaaa atcccagtat ctaaccttac caactggatt 275160 
aaaatcaaac ttctttatga aaaaatatac atcattaaat tgtgaaatca aatatagctc 275220 
tattaattaa aatgtaatgg taggaatagg aaaaagtgtt agtgtgcgtt gtaatagaaa 275280 
gagatgacaa gtgcgcacag ctgaggaacg ttccttatat ttccaaggtc tctgtgtcag 275340 
aagttcgata tcaccataac cttttgatgg gtcttagtta ataaatagtt cgttattata 275400 
ttttattcat actgtaatat tgcttttaaa ttcgtcctct tcattcatgg ctgtctttaa 275460 
atcaatagaa tcagttaaaa tatacagaat ataaagtaga aaatacagcg tattttctgg 275520 
ttagaatatt atgttgattt aagacttaat tgcattctgt gaaatttagg tttaatcatt 275580 
ccacaaaaaa acacttttag aagcaaaagt gatgattaaa gtattcaaca aacataattt 275640 
aataatattg gtttagttga caaaataaaa tgttgttgtg aaagcctttc ttacccttgc 275700 
tctaaaaacg gaaaataagg ctgggcatgg tggctcactc ctgtaatccc agaactttgg 275760 
gaggttgagg tgggtggagg atcccttgag gtcaggagtt tgagaccagc gtggccaata 275820 
tggcgaaact ccatctctac taaaaataca aaaaaattag ccaggtgtag tggtgtgtgc 275880 
ctgtagttcc agctgctttg gaggctgagg catgagaatc acttgaaccc aggaggcaga 275940 
tgttgcagtg agccgagatt gtgccactgc actccagcct gggcaacagg gaaaataaga 276000 
acaaatgtaa tcacataatt tttttagttg ccatagtggg tttattacac aagcatgtgg 276060 
tgatttatat ttaatcataa tttattgatt atattaaaaa atagagacac aatggtctca 276120 
cagagaggga atgaacatat attattttgg ctaactatat tttatgagta actgcattta 276180 
catttccttg gttttgaact tagccatgct atgaacaact gaataatctt tattgttata 276240 
gtttcaaaat atttgaaatt atctaatggc tcttagatta tacaataaaa attataaaac 276300 
attatattga aaaaatgtat taatattaca aaaccttaca ataaaacata ataatacagt 276360 
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?2?JSJ? 9 Cat fS, CtC . ttttatataa aaatcagaag gagaaaatcc agttatttgt 276420 
tctttttttg agattttact aaattacaat gccctcaata atatgaagac atacaattta 276480 
ccaacaagtg gccgggcgct ctgtctcacg cctgtaatcc cagaactttg ggagccgagg 276540 
tgcgtggatc acgaggtaaa gaagtgaaga ccatcctggc caacatggtg aaaccttgtc 276600 
tctactaaaa atacaaaaaa ttagccgggc gtggtggcgg gcgcctgtag tcccagctac 276660 
tcgggaggct gaggcaggag aatcacttga acctgggagg cggaagttgc agtgagccga 276720 
gatcgcacca ctcactgcac tccagcctgg gagacagagc gagactccgt ttcaaaaaaa 276780 
aaaaaaaaaa aaattaccaa caagttgata atttcctaca gaattaacgt caccctgaag 276840 
tttacccctt tggtattaca gctgtttatc atgtacataa ctcccttttt attttgagag 276900 
gggttaaagt taattcgtat ctataattta tagagtaaaa ttatatttgt taacaaaata 276960 
aaaacttttt ataatacttt aatgaaaggc ttattcacca aagaaaataa taaaaattac 277020 
agtatctcat caataatttg gcagatatct caactcttat tttttctcgt attttaagac 277080 
tataaattat ataaaagtaa cagtattcat tgctaaacag taaaacaatt tttttctgtg 277140 
tttacatcta gcttactgtt ttagtaataa aaatgttctc aattttcatg cattgcagtc 277200 
acatccttaa aaaatctgag atttcttgat tttctgtaaa tttggcattt tagtctgaat 277260 
tattaaactt aagtaaccag gatgtaaact tttatcgctt ttttttcatt taaaattaag 277320 
aatgacatct ctctgagttt ttatgtccaa aattctggat acttctaaaa ggatgtgagc 277380 
. attactgcag ggtagatggt aatcctagac tttggaatac aaattaaatg agctgttcta 277440 
tctgaccatt ctctgtaaca cagtttcagg gacttgaacg acctgatttg aaaaacaagt 277500 
atgaacagtt agacataatg tgatctaaag gtagagtatt tatgtataat cttaagttac 277560 
aaaattattt gctgaaaact ttaaagtcat atcgtatctt aatatatata accttattta 277620 
' a !f" ta " attta( ? tc at ctgcctttct ttccaaaaat gctgaaaaat aagtcagtta 277680 
tcttttcctt catttgggga ccatgagaat tttctctttg agaaatatgt gttttttttt 277740 
ttttgttttg ttttgttttt ttttttttgn ggcagctctg tgagctcaaa tttcttaggt 277800 
ttaatctaag aaattccagg agcgttactc tcaaccatga aagctttttg ataggagtga 277860 
actgcagatg atctggcttt ggaacattac cagtaacttt caaagaattt tcttcatgaa 277920 
agtcagaatg tcagtgctgt tgctaaaatt tctggtttaa acatgtttct ttttcttttt 277980 
^ t1: ^^ C ^ C ca 9 ct 9 aaaa tacaaggcaa acactgccgt tcaaaaacat taaaaatcct 278040 
tgcttttatt ctcatattag ggttcaatat cagtggatgg aagtaaaata gtgtacacat 278100 
cactcaaaat aatcaatata ctgagagctg gtagccttac tttgagttct cacttctgct 278160 
tttatgtaag ccttgttgcc tatttggaat gtttttctac ctgatacagg aaatgagaag 278220 
gtgatattcc ttccagttgt aatcatccac gtttctatgg tgtaaagtca ctgtattcct 278280 
tttaccccca acttttacta cgcatcaact gcatgccagt tactggctga gttcttcaca 278340 
ttaacaatga cattaaaatt tcaaaatacg catttgagat agggataaaa atggcagaga 278400 
aatcaagcaa gttcccaaag gtcacagagc ccaataagaa acacaaacag aataacatag 278460 
gagtcagatc agtagccaag atttgatttg tgtgatcacc ataatataga tcttaaacca 278520 
aaactgatgg ctacttaact taattttgtg aaaagaatga acacagagag aaagatgggc 278580 
ctcatacaca cacacacgca cacgcacaca cacagcatat ttaactgcat tctactaaac 278640 
agatactctt tcaggtgaca attggctagc ctttttaaaa ttacttaatt aattattatt 278700 
tctttttaag cagagtcttt ctctgaaacc caggctggag tgcggtggcg ggatccggac 278760 
tcactgcaac ctccaccccc ggggttcaag cgattctcat acctcagcct cccaggtaac 278820 
tgggaataca ggcatgtgcc accatacgtg gctaattttt tcagtaaaga agaggtttca 278880 
ccatcttggc caggctggtt tcaaactcct ggcctcaagt aatccgcttg ccttggcctc 278940 
ccgaagtact gggattaaag ccttgagcca ccactccagg actggctggc ctttcttgat 279000 
agaacagtta agttttaaaa gcaaatgaga aatactctct ttttaaaact ttccctttgg 279060 
gatgaaattg tgaaattata gctaactacc attgactaga caaaactcca actgtgtttt 279120 
tggctttgca tttgttatat ttaatacttg actcaagaaa accttgtaaa gtgtcttttg 279180 
ttattacaga atctagaagc cctaagcaaa tatatcactc tttttctata tattaggaga 279240 
atccccattt gccctatctg tattttcaaa ctttggggct tcaaatgttt gaagtaaatt 279300 
aatatttgta ttttaatatt tacctagtca attttctaag aaattcaact tactaaatgc 279360 
teaatgtcaa tgtcattcaa acacaaagta tatatactac acttaattaa ggattcattt 279420 
acagagagtc atggaaaata taactcataa tttataattt acaagtctca acattttgca 279480 
tatttttcaa aagggaaata tatatacatg tatacataca tacacacaca tttagatatg 279540 
aggtactaaa ataaatataa tgtaacttac taataaaatc aatgtattgc atgaattgtc 279600 
aaaattctat atttttgcta tactaattca actcttttgt gccacattat gttagagtca 279660 
cttatttctt tatggagctg tcagagctag tttactcagt aaaatactct gtatgtttac 279720 
agcctttata tgatattata cttaaatcat atgcaatata attcatatta tgtgcaatgt 279780 
aatccaaayg tattctttga aacatttgta atgtgtattc agtaaaaata tactgaaata 279840 
aaaggtaggc acctggcaca atccatctct tagcccttct cacgttgtat tgaaatgatt 279900 
tctatccaag tttcattgac ctttagtcta agctcctatg gggaatttca taccatgctt 279960 
ctacaactaa aatttaaccc catactgata aataattggt ggtcaataaa tatttgtttc 280020 
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agctggctga agaagtaagc ctttccaatt aataataaaa attaaaattt tatgggattt 280080 
tccaaataat tagaatcaga ggaagagcga gagagagaga gagagactga gatctcaggc 280140 
atatcaacct tgtggataaa tagagggtta tagaaatgaa ttgacaaacg tgctgtaaat 280200 
tgcttagctc tgcgaatctg ctaattctca ggagagtaga aaagacttga catttatcaa 280260 
gtgtgtgtca tgtgaaacac ttcgtgcaag gtaattttca ctctttattc cttttagtac 280320 
caataggtat cagatatcac tgcacagtta gctgcaccta attatccagc taatttttct 280380 
acttatactt tgaaatacgt cactatatga caatggtaaa ctgtagcatt ttcaaatttt 280440 
accgtatgac ggttgtaaaa atgaagttta ctacttttaa acattccatt ttaatttata 280500 
tatatatatt tgttttctga tatttattat cgcatatgaa tgattttata tgtgattaat 280560 
ttatataaca aagtacacca ttgaacttca tgaatatgaa atcataatct ggtatacata 280620 
gacttttagt aatataacag cattttagga ggaattatat tatgagctga gtggcaagga 280680 
taacaaattc tgtcctgttt taattattta agtcacagtt ttactagtag tagaaacaat 280740 
tctgaaaaaa aaaaaagaaa aagaaaaaga aaaagaaaaa caaaagatct ggtcagatgt 280800 
gatggttcat gcctgcaatt ctaataccct gggaagctaa ggttggaggt tgtagaattg 280860 
ctgaagacca ggagttggac ctgggcgact tagtgagacc ccgtctctac aaaaaataaa 280920 
aataaatttt aatttgaagc aatggtctga gaaagtgaga aattgtcgac ttgtaccctg 280980 
gataatttta tgcctgattt tgatttttct aacatttctt actttaaaaa ttatccttga 281040 
aaatttcaga agtggcttca cacttaattt tcatgatcat tttcaatgtt tcttccaaaa 281100 
aatgatctac atggtaatta ttttatgtgg cataattctc tagttgagga gacacgatgc 281160 
ttttactccc caaataaaat atggatatat agctctttgt catatgagca catgttgcaa 281220 
gaatcatttg atatgtttat gctgaaatag aaccttaatg actcatttat atttccttac 281280 
atccaatttt attctgcatt acgaagtgtt taaataatat ggtataatat attgtttgag 281340 
tctagcctct aacatcctgc aaactggcat cagaataatt tacttttcct aagatttaat 281400 
agtcacaaaa atattcatta aaattaaaaa cattacatct ggaaatacaa ctttgctgtt 281460 
taaactgatc attgtactct actttccttt tttgtaatcc gaatggtaag gtataattta 281520 
caagttatat tttataagtt cataaatgag gagtaaattt tgttttataa aaacaaacaa 281580 
taattgttaa gttagtattt atcacagggg gaaaggcctt gattcaacaa tccctgccaa 281640 
gttgcaagtc tcaaatccta cttagatata tattacttgt ttgcgtactt ttaaatgtgc 281700 
cttccttcac tacagagtat atatgctgat ggctttattt tttagcactt acattcatag 281760 
attcctgcca tatgattgca ttttacagta taatacgaac atgttcataa agatttcatt 281820 
tgaatctatt cacagttttt atatatactc catgaaaatg ttatactttt aatatcaatt 281880 
atgrgatatt ctcaaattgt cattgattat tctggtccat tgttgagaga tgttcctcac 281940 
tcttagtaac agtttctaca tcctggaatt cataataata acaagagaca tttagatagt 282000 
gtgatggtta atagtgtcaa cttgattgaa gtatgcaaag tattgttcct gggtgtgtct 282060 
gtgagggtgt tgccaaagga gattaacatt tgcatcagtg ggctgggaaa ggcagaccca 282120 
cccttaatct gggtgggcac catctaatca gctgccagca cagccaggac attaagcagg 282180 
tcgaaaaaaa tgtgaaaagg ctagactggc ctagcctccc agcttacatc tttctcccgg 282240 
gctggatgct tcctgcactc gaacatcgga ctccaagttc ttcagctttg ggacttggac 282300 
tggcttcctt gctcctcagc ttgcagacaa tctaaagcgg gaccctgtga tcatgtgagt 282360 
taatattctc taataaactc tcctattatc tatctatcta tctatctatc tatctatcta 282420 
tctatctata ttctattact tctgtccctc tagagaaccc tgactaaaac acatagtaac 282480 
ccacagtatt atttacccac atatagtaac cattagctta tttaagccat accacaaacc 282540 
tatgatgcag ttgttatttt ccctcatata aacgagaaat ctgaggatga cagaagttaa 282600 
gccatttgct tgagagtttt tagctaataa gtgaccagga cagaaattaa cttatctcta 282660 
taaaatgtta tttttgttaa tattgaatct cagctatagt aaacaggata ttttatgtac 282720 
agactgggat atatgtgaaa tgatttaccc ttatttatat tccaggaaag aagaaaaatg 282780 
attacgttct tctcaggttg ttacttcata aacggtataa ctttttattt tgggatatta 282840 
tgacaagaat atttgggctt tggctgaagt ggaaggactt gggactacat gtcaaatttt 282900 
attttgatag taaatattgt gtccaagggt gcccatcctt taaagtatgt aaattcaggg 282960 
ttacccctct gtgattatag aagacttata tcctgtactt gataatggga aattaactaa 283020 
atgattttcc atataatgac cattagcttc tcaaaggaac taaaaaagtg ttaatgagaa 283080 
aacacacatt tttattgact acaaaccata atttttgatg gtctcataat caatttaaat 283140 
atagaggcat atttataaat atcatttgca tttgttgttg gatttttatt tgatatatta 283200 
tttattttaa tttttttatt tcaatagctt ttgggataca ggtgtttgtt ggttacatgc 283260 
atgaattgta tagtagtgaa gtctgagatt ttaacacacc agtcactcaa atactgtaca 283320 
ttgtatccaa tatgtcattt tttatccctc accaaacctc cctcttctga gtctccaaag 283380 
tccattatat tactctacat tttgcatacc cagagttccc acttacaagt gacaacatac 283440 
agtatttaat tttccattcc tgagttacct cacttagaat aatggcctcc agctccgtcc 283500 
aagttgctgc aaaagacatt atgtttatgg ctgattagta ttccatggtg tatctatacc 283560 
acattttctt ttcttttctt tttttttttt tttttgagac agagtctcac tctgtcaccc 283620 
aggctggagt gcactggcac aatctcagct cactgcaacc tctgcccccc gaggtcaaga 283680 
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gattctcctg 
ctaatttttg 
ctcctaacct 
taagccaccg 
ttagaacggt 
gtctttttaa 
attgaatagt 
ttgtactaat 
gaaacggcta 
ctcattgttg 
ttcttttttt 
tgctgatttg 
ttgcaaatat 
tgtgcagcag 
tgcttttgag 
taggttaact 
taatggatgt 
ttttcccagc 
ttgtcaaaat 
ctttggtgta 
agtatagttt 
ctttttttcg 
aaaatgatat 
tggtcatttt 
ttgtttcatc 
cctccttggt 
gttgtgaaag 
aatgctaatg 
aaatctagga 
tctaagccta 
cttggaaatt 
ttttattttc 
ccccaaaccc 
aacaaacaca 
caatatttta 
aagggactcc 
aggacgcttg 
gatttgcaca 
actgtcaaaa 
tcagttgatt 
caaatggctc 
ctcagctatc 
tatgttctaa 
taaaaaagca 
attaacaata 
ggatgatggg 
tgtcaaacag 
aataaataag 
tgggaaagag 
cacaatagaa 
gatactggtc 
ctccaaaaag 
aacaagcatg 
ccttaatttc 
gattctctta 
aaaaagtata 
ttacataggt 
ttaagcccag 
agaccccagt 
tataagtgac 
ggcttccagt 



tctcagcctc 
tatttttagt 
cgtgatccac 
cgcccagcct 
tccatatttg 
tataatgact 
agatcccctt 
gtacattccc 
ttgtttgttg 
tcttaatttg 
gagaaatgtc 
tttgagttat 
tttctcttat 
ttttttagtt 
gtcttattca 
tctagaaatt 
atgtataagt 
accatttatt 
taagtcagtt 
tgtgtctact 
caagtccagt 
ggctcttatt 
tggtattttg 
catgatattc 
tacaattttg 
taactatatt 
gggttgagtt 
atttgtgtac 
gtctttcttt 
cccttacatc 
ccctactctg 
cagatggaat 
ttctgcagta 
ctctttattt 
ttttaaagca 
ccccacctcc 
gtcagtaaag 
cgtgaagcca 
tcaatgaaaa 
gatataatat 
agtgaattgt 
tacaatttgg 
tgctcagcaa 
aaactttttg 
aactaaaaca 
tagatcacaa 
catccctgta 
ctgcacgtgt 
actcagggca 
aagacatttc 
atcaaagcaa 
ccattt.tctc 
ggccatgtat 
agggaacata 
ggatcctgat 
ttatatttta 
aagcatttgc 
catgcattag 
gtgttgtttc 
aacatgtagt 
tccatccatg 



ccaagtagct 
agagacaggg 
ctccctccct 
ataccacatt 
caattgtgaa 
tctttttatt 
ttatttcttg 
accagcaatg 
actttttaat 
cattttcctg 
tgttcatgtc 
ttgtagattc 
tctggagatc 
taatgtgggc 
ttaattcttt 
tatagtttca 
tgagagatcc 
aaacagtggg 
gtaagtattt 
gtatactggt 
aacgtgatgg 
ttggttccat 
atagcaattg 
attcttccaa 
ttcagcaata 
accaggcatt 
cttaatttga 
aatgattttg 
atgatgtaaa 
ccagttacgt 
acccagatag 
aaatgaagac 
gtcttttcac 
gttcacctaa 
tactggaaag 
accaacacac 
gaatttctca 
ggcttcacat 
cttcacaata 
tagataagtg 
ctaacattcc 
ttggcagggg 
actgtgaaat 
gtatgttaat 
attctacctg 
aatcaaagaa 
aggaacttag 
gttttgcttc 
aaaaaagagt 
ttttcaaaga 
gaccttatac 
attcccctaa 
ttaagaaaac 
aattttgcca 
gccttgtgtc 
ttttaagttc 
catggtggtt 
ctatttatcc 
cctccctgtg 
atttggtttt 
tccctgcaaa 



gggactacag 
tttcaccgtc 
cggcctccca 
ttctttatct 
ttgtgctgca 
tgggtagaca 
actaaatctc 
tttaagcatt 
gattgcattt 
ataactcata 
atttgcacac 
tagatattaa 
gtctatttgt 
caattaattt 
gcctaggcct 
ggtcttagat 
agtttctttc 
tactttcccc 
ggatttattc 
accacgttgt 
ctccagattt 
atgaatttta 
cattcaattt 
cccatgagca 
ttttgtagtt 
ttattttttt 
ttctcagctt 
taaattgaga 
agagaagcaa 
tgctaactca 
tcttttgatt 
caaaagaatg 
tgctgaaatc 
cctgcatgcc 
aacagaagca 
acaggtatcc 
tgatccctcc 
gactgtattc 
taaagaatat 
caggatggta 
atggcaaatt 
aactaggttt 
tttccatgaa 
tattgcacca 
cttgattgaa 
acagctacat 
tctacagact 
cttgtattct 
atattgattg 
aactgagtat 
tttgtcaaag 
atcattctca 
agaatttacg 
aagatgatat 
ttagttatgt 
tgtgatacat 
tgccacactt 
tgatgctctc 
tccatatgtt 
ctgtttttgt 
ggacatgata 



gcacctgcca 
ttgcccaggc 
aagtgttgag 
actcactggt 
ataaacatac 
cccagtagtg 
tacactgttt 
gccttttcac 
ctgcaggagt 
tgtttgttag 
ttttgatggt 
tattttgtct 
tctgataact 
attttgttta 
atgtccagaa 
ttcagtcttt 
ttctacatgt 
aatttatgat 
ctgggttctc 
tttggaaact 
gttcttattg 
cgtttttaaa 
gtagattgtt 
tgggatgtga 
ctccttgtat 
atttttgtat 
ggttgttgtt 
attcactaaa 
aggccaatgg 
gaaaaatcag 
aaaatcatat 
tttatgtgga 
gttgatatac 
tttattcctt 
ttaaagaaaa 
tctttctcaa 
tcctttctct 
cataatagca 
agctagaatc 
acttttataa 
tgtatgaaga 
gtgagtttgt 
atataatagt 
acttgtgttg 
gcaaaaatgt 
agtctgactc 
cacataggcc 
ttgatgtcag 
acagcttcct 
aattgtcaaa 
tttactttaa 
gtatatctca 
ggaggacaag 
tgccatacta 
tttttcttaa 
gtgcaggaca 
atcaatccat 
cctccccaca 
tttattgttc 
gttagtttgc 
tcattccttt 



cctcacctgg 
tggtcttgaa 
attacaggca 
ggatggggac 
acctgtagat 
ggattgatgg 
cccatcgaga 
catattcaca 
gagatggtat 
ccatttgtat 
ttttttttct 
gatgcacagt 
atttctttgc 
cgtttgcatt 
gagttttttc 
gatccatctt 
ggctagccag 
tttgtacgtt 
tatactgttc 
atagccttgt 
catagaattg 
aagtattgta 
ttggtcagta 
ttccatttgt 
agatctttca 
gtttgcagct 
ggtgcatagc 
tttggttatc 
agtgtcaccc 
ctgaagggca 
ggttattagt 
aactgtgttt 
taccattata 
agaagatttt 
aaaataaata 
ttcatcagtg 
acagggcata 
caccaaaaac 
atggtaatga 
actaactttt 
ctacctggaa 
ttcccccctc 
tacgtaaata 
tttgactact 
caatgtgtaa 
agacaggaca 
ctgcctcagg 
ggtcagatac 
caagacagcc 
aaagaaaata 
atgtcacctc 
gctttcatta 
acagaccaga 
ctatttctgt 
tcagaagttt 
tgaaggtttg 
cacctaggta 
accccgaaat 
tgctcccact 
tgaggataat 
ttatgggtgc 



283740 
283800 
283860 
283920 
283980 
284040 
284100 
284160 
284220 
284280 
284340 
284400 
284460 
284520 
284580 
284640 
284700 
284760 
284820 
284880 
284940 
285000 
285060 
285120 
285180 
285240 
285300 
285360 
285420 
285480 
285540 
285600 
285660 
285720 
285780 
285840 
285900 
285960 
286020 
286080 
286140 
286200 
286260 
286320 
286380 
286440 
286500 
286560 
286620 
286680 
286740 
286800 
286860 
286920 
286980 
287040 
287100 
287160 
287220 
287280 
287340 
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atagtattcc atgatgtata tgtaccacat tttctttatc cagtctatca ttgatgggta 287400 
tctgagttga ttccatgtat ttgctatggt gaatagtgct gcaatgaaca tatgtgtgca 287460 
tatcttaaac aaatttgtaa gaaaaaaaca aacaacccca tcaaaaagta ggcaaaggac 287520 
atgaacagat acttctcaga agaagacatt catgcagcca acaaacaaga aaaaaatctc 287580 
aatatcactg gttattagag aaaagcaaat ctaaaccaca agataccact tcacaccagt 287640 
cagaatggtg attattaaaa agtcaagaaa caacagatgc tggcgaggct gtggagaaat 287700 
aggaacactt ttacactgtt ggtggcaata tcaattagtt caaccattgt ggaaggtagt 287760 
ggggcaattc ctcaaagaca tagaaccaga aataccattt tactcagcaa tcacattact 287820 
gggtatatgt ccaaaggaat ataaatcatt ctgttatcag gaaattttaa agcttctttg 287880 
tgatgtgttt tattagttaa aacaataatt tattagttaa aatgttgtgt ctttcctaac 287940 
acttaccaca tggtgtatca tgcaatgcac actaatgaac gcttcaaaat tagccttaaa 288000 
tgcacagtct cctcacttgc actctttgct tatcttgtgt ctgatggctc ttctgcatga 288060 
acaggaaaat ctgaattctc acaattattt ctaaagcttt tattaccttc ccttaaccca 288120 
agcccagttt tattacctgt atgacactta gacagtttac attgtaatcc caccctctaa 288180 
tggatttaaa ctcattttcc actttggtat ccccatattt agtcattatt tgtacttctg 288240 
ttattgcaga cccactccac cttgtgttgc agtgtgtgta tctgtgatcc cctgtagagt 288300 
gtgagttcct agagtcagag tatccaggga tagctctctt taaacatcta atgggattct 288360 
atgaacataa tgaaataaca tttatgttag ctaaatgtat aaagagaaaa tatatacttg 288420 
cccctttatt aaagttgttg tttcatatta ttatatgcca acaccatact agtcctattg 288480 
acagagagtt atatgtggta tttgtattac ctttaaagat attagagcat aaaataattg 288540 
gaataaactg ttaggacaaa ataaagaggt acaatacaga cactgaaatg ggattggcca 288600 
attctactta gctggacaga aaagctttct agaatagatg aatctcaaag gataaacgga 288660 
ggcataatcc ttgcagacaa gaagaaaagt gctttccagg tgaaagaagc agtggtacaa 288720 
cgaacaaata tacaaaggat gaggaaggtg gagcatccaa tttaaggatc aaaataacaa 288780 
ataattaaat taggttattt agaaggtaag atcatagata aacacacaca tgcatgtgtg 288840 
taatggaatt taaaatcatt ttttaggtaa gaaagattcg ttggaatgtt ttagagggaa 288900 
gaattaagta attctataca cattttagaa agctcattct gatatcctat aaagaatgtg 288960 
tcaagtgtga aaattagtaa gatgaagagt cagtaatagt gaatctaaaa atgtgaatga 289020 
gcacaagaga ttattacata tatatgtgtg tgtgtgtgtg tgtgtgtata catatatata 289080 
tatttttaat ccaaagcata tggtaactga taacaggcgg gagtattaga aagggtcatc 289140 
caggttgaat ccaggatttc atgggagata gaaggaatga acagtgtttc cagacactga 289200 
gattaagaat gtgagaaaag aagcagttgt aagtgtgttg aacttttgac gtctatacaa 289260 
ggagtctact ggttactaga aggtttacaa agagacctag gtggatatta ctagtctttg 289320 
tgacattata gtgtactggt ataactcttg gtgctatttg gggttgctaa aaatgaaata 289380 
aggaagggaa ggggacagaa gataaaaatg gagggaggat aaaggggaga acatgagagt 289440 
ggcaagttaa aagtagggca ctagagcaga agacagaagg aaagcgaggc atacaaggta 289500 
agggaaaggt aagaaggagg cgagaagaga acctgtgtga tttccaacat ctaagtgtgg 289560 
acaggagaga ttgtctgtaa aagggagaag acatggtcac gaggaaaggg ggtgctagca 289620 
aattttcatc atggaagcct aaggtaaaat tttcagagaa ggatgtttaa tgtaagtgat 289680 
atctaaaatg gccaagatca atcatgatga taagtgaaca ttcagataaa gtttcagaac 289740 
agataaaact ctgtaatggt gacagcagcc atcggagtgg ttacatggta tggcctgggg 289800 
agggatgtgg ggaatcctgg ggtgttgaac atggcctatg tctcaaagtg gtgaccaatc 289860 
acattaccgc acttcatgtg ctttgctata agcaggtcat ccctcgattc aaaatgtgtt 289920 
aaaatgtaat acgaaattaa tataacacct aagagggtaa aaaatattag tggctttctc 289980 
ggattacctg ggtaccatca gctgaaggat tcaaaccaaa ctctgttaga ctcagatgcc 290040 
ttagctgctg accgctgtac tactctgctg tcaaatccaa tccttgccaa gcctctccat 290100 
tggcatgtta aatggactta gtctctgttg tcccttttag tcttattttg aaggcaccta 290160 
cctattctac caactgataa attgaattgt tattactgtt ttacagtaag atttggtcag 290220 
aagctttggt tagaccataa ttttttgtca gatctgttgt tctgaactct ccattaagga 290280 
actgtttaga tattacatat acttttgctg aaattcagag gtactcataa atttgacgac 290340 
ttttaaaaaa atgtactatc agcagtggta aatgataact aaaatggtat aatttatctt 290400 
aacagtatct tgagcttaac aaagcttcag ataaaaacaa gattgcattg aatatcatgc 290460 
aataaaagga aatacttaaa gcaaaggtat ttatatgttc ttatgtgtat gttcttacag 290520 
ggatggaaaa gtctattttc ataaaatcta attttttaat ttacaatata gagccctttg 290580 
cttcttttcc tgtttaggtt aagatctttt cttctcttga aagattatgt atattttaat 290640 
atgattaatc ttattttacc tttacagaag tactgtgaat tagcagggca gatattatga 290700 
catctcctta acatatgaat taattgtggg ctgccaaata gtacatgatc ttttactgga 290760 
acgtagatcc aggtggctca aatagatatt taagggcaga gcacctaaac tgaccagcca 290820 
atgtaagcag ttcatcttcc gagcagattt atttttcact gtaaccaaat gtatttacat 290880 
gctcctggaa aaagtgttct cccgtgagct ttgttagtgt gttttcacca aggaactgct 290940 
atctgggttg cccacactgt gtagagtttt tgtcctgaaa agctaaagtc catatattga 291000 
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agaaaaccta ccacataaag atgtttccta gaatacccgt catgatcaaa ggtttgcagc 291060 
tgtagtggaa ctaactaatg gatgcttcat tttagtaagg gcctcttttc ttccattgat 291120 
catagttgaa tattgtggtg tcagtttaag atgagtccat gtgtgacatg gatttaaata 291180 
cccaaatttc agtggtttga taaaaattat aagaaatact gcaattatct caaactaaaa 291240 
cttgtcaaga tttgtgtaat atcatagaat gtggacaaag tgattaaaaa atatattcta 291300 
ttgactctgt gacataattt tagcagatac atctttatat tggccaacct ttctttgaat 291360 
taatcttgac ttacgagttt agaatagtgt catgtctgga gagaaaaaga cttcatgtga 291420 
aatataaaca ctctatgtac tattttagta ttaattttag aaagatattt ggcacacctt 291480 
gaaacgtgat ctaatatcac tttcaattct aagtctattg catcatcaag cagaattata 291540 
taaactattt ttaaaagttt atcaactgga ttttatgtta ttagcaatac atgagatgtc 291600 
atggctttaa aaggatggct attttattat gtagtatcct tcaaatatta ttttagtagt 291660 
tttttgtact aaccataggc caattaatga ttcagaaata tatttaattt ctattttata 291720 
aatcttatat tcatggcaat atactacccc tcactgattt ttgcttggct ttattaaact 291780 
gagtttctgt tttatctctg caaaatataa taaaaaataa aaatatgctc aaatattaag 291840 
caccccataa atattaggtg gataaataaa tataaccaaa ctgctcttgc aaaagtgaaa 291900 
aagttttaaa tatagagatt tcattccatt ccctttaatt tttcaaaata ttagatgata 291960 
caaaaatcat gtgcacaatt cagactcata gctattttta gaaactataa tttttgattc 292020 
aaaagaaatc tgcaatcttg caaaaatgag actcagttta acagtgtatt tccaaagaaa 292080 
ttgtagctgt tattcttgcc tttcttctta gacaagagac ttgtgatggt atctcttaat 292140 
attccatacg taacaagaaa aagagcagaa aaagccatgg gaactataaa tataggcagt 292200 
gatattgtgg tattgtggtt gaaaacatta tctggaagtg agaccacaac aggttctcgt 292260 
tctagctctc catctaactt ccatctctag agatgtgccc ttgacaaaat gattttttcc 292320 
tctggacttt aatttcttta tctggaccag gtaagctcaa agatctcagt ttagtatttg 292380 
atttttccag attaaaatct aagatgtggt atcagataca cttttctatt tttcttttat 292440 
ttgtcagaca aacaaggtga aaatgtgttt aaaaactagg tttatccatt tttttcctgt 292500 
acagtgttcc taattcaaca ctacattttt ctatatactt ttagtagaaa gtgaatcata 292560 
agttacttca gtaggatata tctgttgttt tgctcttgta ctcttccttt actattacta 292620 
ttatttcata acataaataa tgttaagaga tctgggtcta aatgtaggat ttaatttttc 292680 
aagtttttac aaattctatg aattccctac agtttctttg gccgtcacta tttgtctggt 292740 
tgtttataga agccgagaag cagaacccct atggctgaag actggtcctg atagagattc 292800 
ttcttctaat cttctcaaac caggaaggtg ttcaggtcac tctgtaaatg attcaaagac 292860 
agatactatg taaagttaca tttttccttt aaaatccagt ctgtaagcaa attttatgcc 292920 
ttattttaaa taacaagata tgtagctttg gcaacatgtc atcttatttg gataggctat 292980 
gccaacagga cccaaccaga gcaagccaaa aaagatcttt gaatgacaga cagcttaaga 293040 
agaacagaat ctgttagaga aaataagaga aaaggagaaa tgtactgagc agaccaaaaa 293100 
aataaaataa aatagaataa aaggagcttc cacacaaggc tgttgggttc attcaccaca 293160 
taccaaggga acactaccaa gtaaatatta taaataaaat acttgctgga gaaactgaat 293220 
atccacctgt aaaataatga atcagaacac ctacttcact tcatgccata tacaaaaact 293280 
aacccaaaat agatcaaaga tctaaattta ataagtataa atataaaact cctagaaaga 293340 
aacataggtg aaaatatgca aaagttttta ttagggaatg agttcttata tatgatgcca 293400 
aaagcacaag caaccaaagg aaaatcgata aattagattt tatcaaaatt aaaaactttt 293460 
atgcattaaa ggatactatc aagaaaatga aaagcaagta acagaatagg agaaaatatt 293520 
tgcaaacata tgtctgacaa gtattaagta tctagaatgc ttcaagaact cttacaaatc 293580 
agcagtaaaa acacaattca attaaaagat gagcaaagtt caatattaag ctattcccca 293640 
gagaagatac acaaatggcc aataagcagt gaaaagatac tagcattagt cattagagaa 293700 
tgcaaatcaa acctacaatg aaatagcact ttgcacctac taggtggcta taattaaaaa 293760 
ataagcaaac agaaacaaaa caactattgg tgtggatgta gagtaatcgg aatcctcata 293820 
cattgctagt aggaatgaaa catggtacag ccactatgaa aaaagaaatg ttttcagttc 293880 
ctcaacaagt taaacataga attaccatat gacccagcaa ttccgcagct aggtaaatgc 293940 
ctgagagaag tgtgttagtc cattttcaca ctgctgataa acacatcccc ggggctgggt 294000 
aatttataac ataaaagagg tttaatggac tcccagttcc acgtggctga ggagggcctc 294060 
actcacaatc acagcagaag gtgaaagaca tgtcttacat ggcagccagc aagagaggat 294120 
gagagccaag tgaaaggggc tttgccctta taaaaccatt gagatctcat cagacttatt 294180 
cactaccatg agaacagtat ggagaaaacc acccccatga ttcaattatc tcccaccatg 294240 
tcctttccac atgtgggaag tatggtaacc acaattcaag atgaaatttg ggtggggaca 294300 
caacaaagcc atatctgaag taagacaaat tatctcacaa aactatacac caatattcct 294360 
agcagcatat tcattatagg taaaagaaga aaaaaggcac atgtccatta tattcataca 294420 
atagaatatt attccatcat aaaaaggaat gcggtagtgt tacatactac atgatgaacc 294480 
ttgaaaatat tatgctaagt gaaacaagtc tgacacgaaa gaccacataa tgtatgattc 294540 
catacagaag aaatgttcag aacaagtaaa ggcacagagg cataaaatag attagtggtt 294600 
gccagtggct ggcgggagat ttccattaac agctaacatg tacagggttt ctttttgggt 294660 
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tatcagaaaa 
aacaaacaaa 
tgtcagtaaa 
tacacaagta 
tatagtcagt 
aactacttgt 
tgcttcttct 
tcttgaaaat 
ttcactttat 
attttaatat 
gtaaaattat 
atttttacat 
gaaggaaatt 
gaaaattgtg 
gatagatgat 
gttgcaatgt 
tgattggtcg 
tgcattggtc 
acattttatg 
gtgtggaagc 
ttttcacttg 
gggtcacttt 
atggtttgtc 
ctctcactcc 
agaagtgctg 
cataatgaac 
ttggttgttt 
agctcctcga 
atgcaatgtt 
atggagtctc 
gctccgcctc 
tgggactaca 
gtttcaccat 
cctctcaaag 
gttttggctc 
cataactgga 
tccttttctt 
gtttcttatg 
cttccgaatc 
acatattttg 
aggagacttg 
atcggtagct 
gaataggtgt 
agaacacact 
accctccact 
atccaatgag 
aaagactaca 
aacgctatgc 
cctaactttt 
agaattaagt 
acctaaaact 
acatatatac 
aaatatccaa 
atatatattt 
cactatggac 
cctctttttt 
ttaaagttgg 
catattctat 
gcactattca 
attaagaaaa 
ttcatgtcct 



ttctggattt 
acactaacat 
aatgctaaag 
tgcatttagc 
taagattgaa 
gctgggcact 
agaaaagatc 
tattgaggac 
tagatattaa 
aaataatgca 
ttcatatttt 
ctgctccagc 
ttgccttgca 
aatatatatg 
agatagataa 
gtaatccgga 
ttctttcact 
atttggaaaa 
atacaaataa 
cgtaagctgc 
aaagctagaa 
gttcatgttt 
agtcattctc 
aacgaccgca 
gatttcatct 
acttttcgct 
gtttacactg 
ttcttcatct 
agcacagtga 
tctctggcgc 
ccgggttccc 
ggcgcccgcc 
gttggacagg 
tgttgggatt 
cagagtcccc 
taactgctgc 
gttttcttac 
gttattttat 
ttctctaaca 
aggagttttt 
tgaaccccag 
gaaatgcctt 
tgtggcctct 
gctttctaag 
gtccttgtta 
tgacctgttt 
attgcatttt 
ttttgtgaca 
gtaatggaaa 
ccattgttct 
gaatgtaaga 
actggggaaa 
atatttctta 
ttatatgtat 
agaagcagta 
cttctgatcc 
ctatggacag 
attataaatc 
caatagcaaa 
tgtgtcacat 
ttgtagggac 



catatttatt 
atactcttgt 
taaaacaacc 
ttttataaac 
tacatgttta 
gtcctaggtt 
aatgatttga 
cccaaagaac 
actgagaaaa 
tttattaaaa 
tgcaaatatt 
atttagtcta 
cggatatgga 
tatatatatg 
ctacatcagc 
actgtatcag 
ttgaagagat 
ggtttgttca 
cacttgttaa 
caagctaatg 
ttttataatg 
gaaaaagaaa 
tcaattagaa 
tgagcgcccc 
gaggatgtga 
gcttcatcaa 
ggtgtgtagc 
gcactttcgc 
aaaagacaaa 
caaggctgga 
gccattctcc 
gccacgcccg 
atggtctgca 
acaggcgtaa 
ctaaaagtgt 
tctagattct 
acatgggaga 
atggcttcat 
tgccctcttc 
attatgttca 
agtcagtggc 
agtcttgctt 
tagtaactga 
tgagaagagc 
ggttatagtg 
tgggaagcaa 
agaatataga 
gttaatggaa 
ggattggatt 
taacctgtta 
ttttgtgagt 
gaatatcaaa 
tatacacagg 
gttttggtca 
ccatagtttt 
atctccttcc 
caaagacaag 
atgctgctat 
gacttggaac 
atacaccatg 
atggatgaaa 



gttgcagagc 
aactggtgaa 
tgaagagttg 
aatgtcttgt 
attcctgcct 
caaataattt 
aaacattttt 
tattttttat 
aattttaagc 
tattttctat 
ttgacatctg 
ttgcaggctc 
aaagagagga 
tgtgtgtata 
actaaacaag 
tgaaattctt 
cttttaatca 
tgaagctttg 
taccaccact 
gtggcacaca 
aataacaagt 
agtctgccaa 
gcgaagagct 
ctgccgtcaa 
aaaaggcatg 
ggacgtgctt 
agtgaagact 
cattgatgcg 
tacatttcgt 
gtgcagtggc 
tgcttcagcc 
gctaattttt 
tctctttacc 
acgtttctta 
attgggacac 
aaaactttct 
ttcttcctca 
gatgcacaca 
ttgcctatag 
ctctgggaga 
agtgcctgca 
cagttatttc 
gtgatgggag 
ccagcttcct 
cttatcaatt 
acatgttaaa 
ctagtgctac 
tttaaaaagg 
ggaatggaat 
ttcttcagtg 
tttttttttt 
aggtacattt 
tataggtatt 
tttcacttgg 
ttcctcttat 
cctccctcaa 
tgaaatgtaa 
aaagacacct 
caatccaaat 
gaatactatg 
ttggaagtca 



atagtgaata 
ttttgcttat 
actatatata 
taaattatta 
ttattgaata 
aacaataatt 
atttcaggac 
ataggataca 
atgtgttgaa 
tagaaacatt 
actgaagaga 
ttgttttggt 
gtatttcaat 
tatatgtgta 
tggtaatttc 
ataacctgtt 
aaagtgattt 
cagattttcc 
gttctcaaaa 
cacgttttcc 
actgtcagat 
atgcctactt 
accaaaaagt 
ctgtcacgct 
tactcaggat 
aactaaaagc 
tcagtggcac 
ccactagttt 
tttgttttgt 
gtgatctcgg 
tccctcccga 
gtatttttag 
tcgtgatccg 
atattatggc 
ccagagacct 
ctgagttgct 
ttgtctctga 
cacatcctta 
atcatgacag 
gctttcgctc 
ctccctagct 
ctttaaatag 
aggtcatttc 
gacagagcac 
tctctccttt 
agaaacccta 
caattgtgat 
aaaagtaaac 
cctaatacat 
acctcgcagg 
tttccctgca 
gttttattca 
tagttattgg 
taaaatagag 
atcaactttt 
ttccagtttg 
tttaaaatac 
gcacacttat 
gtccaacaat 
cagccataaa 
tcattctcag 



tactaaaaca 
gtgaaaaata 
tttgtagata 
ctagcaattt 
ttcattctgc 
aagagagaca 
acttttatac 
tccatcaata 
ataataaact 
ttagagaaga 
agattttcag 
taaaatgtat 
taccttttca 
cagatagata 
ttacatgttt 
acattaaaat 
tgcagaattt 
aattgttgat 
ggagttttaa 
gaaattctgc 
gtttccctta 
atgaattacc 
gggcagcgaa 
gccgtgcagc 
ggaaattcca 
agcatcgttt 
ctagtgcggt 
tgctcccctt 
tttttttgag 
ctcaccacaa 
gtagtgtagc 
tagagacagg 
cccaattcgg 
aataaatata 
gcctgagcac 
ttatcaattg 
aaatcctcat 
tatatatttt 
tattctacgt 
ctcagggtac 
tgctgctcac 
atgctaggtc 
tttgcttata 
cattagaaga 
acagggccca 
cttgaaagtt 
tgtcctcctg 
gtatctataa 
tatttctctc 
actgcaagct 
tgatatatag 
ttcaaatatt 
atggtaaaat 
catcagtgtg 
cttctcattc 
tttttctctt 
gcacatttaa 
gtttattgcg 
gatagactgg 
aaatgatgag 
taaactatcg 



294720 
294780 
294840 
294900 
294960 
295020 
295080 
295140 
295200 
295260 
295320 
295380 
295440 
295500 
295560 
295620 
295680 
295740 
295800 
295860 
295920 
295980 
296040 
296100 
296160 
296220 
296280 
296340 
296400 
296460 
296520 
296580 
296640 
296700 
296760 
296820 
296880 
296940 
297000 
297060 
297120 
297180 
297240 
297300 
297360 
297420 
297480 
297540 
297600 
297660 
297720 
297780 
297840 
297900 
297960 
298020 
298080 
298140 
298200 
298260 
298320 
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caaggagaaa aaaacaaaca ccgcatgttc 
acacatggac acaggaaggg gaacatcaca 
ggggagggat agcattaggc gatataccta 
caccagcatg gcacatgtat acatatgtaa 
aaactttaag tataataata aaaaaaaatt 
ataattagcc tatgttacta ttcattttcc 
aactgtgact gaaatgtatg agtttctcaa 
acaaaaataa attgaattgg ttaatataca 
attttttaga gctattccac attattcaac 
tttttaattt ttgaattata atagaagcag 
tgtaaataca ctcatttttc attttaaaca 
taccaaagat attaggggaa aaaataaact 
aattcatcct gctaaccctt tagggtattg 
ttatctagaa aactgagaga acaactaatt 
ggcttttgtt caacacaatt tttaagtctg 
ttgccatggg gtgtgtgtgt gcacatatga 
gtgtcatgga aagcgtggcc tggggatttg 
tggtttcttt gcaacattat tggagggagt 
atttttggta tgatagacat ggtaattact 
aaaggggata ggagtaaaga attaaagtga 
agagtgattt cttgaataat cccttcatct 
aattatacaa gcaagcctag acacacacta 
tagctgcatt aaataccaag agaagccttg 
cagttgtttc cagagaagag gcaagttata 
gtagaacacc atgcaataag gcgctattga 
acaaagacgg gccatcgaga tattagagga 
atgtttcctc aatcttattt cctttctccc 
attgcgtttg tacagattac aggaaaacac 
cttctgcagt tggtaataaa tgagatctac 
ctttcagatg actctcatgc caccaggtca 
cacttgggtg atcaccaaga atgtgggact 
gtgtcctttg atgttactgg aggtgtaaaa 
ttaatggtat agtaagtatt tattatcttt 
tttttaaaaa tgtcaaattt taattttaat 
atatrgtttt acaattcagt ggaattctct 
atattttaga tgttttcaat catgagttat 
tcactatgag ataaagggac gaaatgagtg 
tttccatggt tacattgatt cactagcagc 
accccttgcg ggcagggtct agtatatttt 
tccaattttc ttcctacatt gtagtactag 
atgtatgcta cttaggcagc tctggcatga 
acattccagc attccaaaat tctcaaagca 
caaaatttga aacacgaaaa ctgacttttt 
tgtgctcttt gcacgtgcat aaacaataag 
taaagtaaac atacactatt taaaatactg 
ataaaatctg aatgatgaaa gtacttaaaa 
ttgtggcagt cttcatatga ataatttaac 
atgaataacc catttgtata ggagtaaact 
agctcataca acgtacaagt agaaaagtga 
atgatttcca ttatttttaa ataagagtaa 
agaatgataa aaatggcaga atacggagcc 
caaccaatat ttatattaat agcttatttt 
cattatttct atctatatct ctatatagat 
acagtgataa acctttctct taatatgaat 
agctatattg aattccgatg tatggatggg 
agtttgttta tatcattaat attctcatat 
caagagttaa ttttatttct atttactgta 
ttgagatgac ataatgccta tcagtgcctt 
cctctttcaa gtgtgcaaca tcctgataat 
ttcattataa atgatatttc tatgtgtgtg 
aaaagttatt ttttaatgaa gcatctgatt 



tcactcatag gtgggaattg aacaatgaga 298380 
ctctggggac tgttgtgggg tagggggagg 298440 
atgctaaatg acgagttaat gggtgcagca 298500 
ctaacctgca cattgtgcac atgtacccta 298560 
cttcattgac ttacatgcaa tgaataattt 298620 
ttttattttt ttcttgcaaa attatgttag 298680 
tagatgctat tgattgagtt attttgaagc 298740 
ctttgtaggt caaaactggc cctgggaatt 298800 
ttcgtaagct tactgttggt tatttatata 298860 
tttaggactt aaatataaaa atatgtaaaa 298920 
catgctttca tgtcataacg gatcccaaaa 298980 
gttgttgtct taaaatacat tcctttgttg 299040 
acagcctatt atttttcata tgtgccatct 299100 
atatcctttt gcaaatgagt gcaacagaat 299160 
ttgaacagga ctgttgaatg gcttaaaaat 299220 
gaatatgtat gttaatattc agccacctta 299280 
attatacatt gtgctttgtc ataaatttaa 299340 
atgcctctat gtctttcatt tagcttgtat 299400 
gctcttaaag tttgtgcaaa tgccactggg 299460 
aggtcatgaa ataaacctat taacaaaagc 299520 
ctccccatgt cataatcctt tctcataaat 299580 
gtgttgaaga attgtgctgc aaaatgcatt 299640 
agttccaaaa cctccttaaa gtggaattct 299700 
gtacatataa tctattatga acgtatctga 299760 
attgggaggg aggtatcaga gctaaagagc 299820 
gatttctctt catatgtaat tctgctttac 299880 
taccgcctct ttttttttta gccacaaatc 299940 
tcaaatattg ctcgaagaag cataatcttt 300000 
ttgtaaaaat tagtgtcatt gtctacaaca 300060 
atattcctgc tagggatgag ttagttctta 300120 
cataagggca atggatgatg caggcgcata 300180 
tgaaggtttt ttttcattct aaatcaaagt 300240 
atcctggaag ggattttatt gcttttgaca 300300 
ttaattgttt aaaacaattt ttaaactttt 300360 
cttctttata ttattgtact tgcattaata 300420 
tgtcactatg ttttattaga aatatccttc 300480 
ctaatgtaat tctacagggg tcagatagca 300540 
atgtcctgaa taactcctgc tgtattgcaa 300600 
taatgtttct gcatatagct ccaatttagc 300660 
atcaaaaaat acttattggt tgaacctata 300720 
ggcatgacat tatattttaa ataaagaaac 300780 
taaccctgtg taaaaccata gttaaagttt 300840 
aaaatgttaa ccatacaatg actaatgcag 300900 
aaacattatt aaaataaaat atttatattt 300960 
agcaaacatt tatataagta ggggatttat 301020 
agtactaata gtaactaatt tattgaatct 301080 
caccagaata gttacggaag gtaggtacaa 301140 
gatgcctaga aagagaaagt cactaattca 301200 
ggattttagc acaggtaatc ttgatacaga 301260 
tacatgtatg tatatgaatt caaataatac 301320 
cccttttcca cccctcgagt tccttctccc 301380 
atttatctta aaaatattat attttacact 301440 
atatatcttc atcattataa aataagatat 301500 
ggatgaactg atctaccata tattattagt 301560 
cctaattatc ttttttatta ataaaatttt 301620 
tttgctattt gcaaaatatg ctggaataaa 301680 
aattgtgtca ttattactga agtaaaagga 301740 
taaaatattt caataacaac aaaaaagaaa 301800 
taacttatct gggtaatgag tgtatgaaag 301860 
tgtatttaga atttttacaa tagttttgtc 301920 
tttagggagg gatctaagct ttccacacat 301980 
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gtatttctag 
tgtaatgaga 
gaatgaggta 
tggtgatgag 
tattacttct 
tctctccata 
atatctccat 



gcttctattc tccatagtag tgtaagaatt tcaaatgtgg acacgttaaa 
aggaggatta agtgatatga atgaaactgt tggggacctt acattttatg 
gctgaaaggg ggttctacaa agaagtgtga taatagagga cagatttttg 
atgtcattcc aataaggact atatttatct agggatattt tgaaagaaac 
gagaatgaag agaaatagtc attagtgcaa acatgtcaag aaagaaaaca 
ttggagatat tttctctctc tctctctctc cccctgcctt cccctgtccc 
aggctctatt tatctatcca tcatctctgc atatgtcttt atccatactt 
actcatatct ataccgatat ctatatttat atctgtaagc tgatcttagc ctactcatca 
tggtaacata tggcagattt gtgtatacct gttatcagtt cccaaatctt tacacacact 
tctctgtcat catgattaag gggagtaatg tgctcttacg tttttgagaa ttgggcccaa 
tggggacagc tatttttctc tgttataaga ttccatagat tgtggcaaca atggcaccat 
gatgacttct aaactcttga tgggagactt gcttctgtga cttaatggat tattagccct 
gcaatgggaa ctacacaccc aaaggccaag aggaacatgc aaaggcaaac 
gagagcccat caatcacgag aagatggcag gaagttgtgc cgttcccatt 
ggacagctta agattgaaat ataatagaaa gttatatgac gtactctaca 
tattttatta ttattattgt tttgtaatgt agaaaacatc tacccctaaa 
taataggtta ctagattttg taactaaata agataataga caaaatataa 
aaaagtcaga aatgatcctc atccatttta gaaacccaat caatttcaat 
tctcttcctt cttcaacaag gcactacgtt atcctcatcc ttccagcttt 
gctttgatca ttcagggtta gtcttcattg cccatacttg ttcctatttg ttttaaataa 



agagagacaa 
gcagttattc 
gacaagttta 
aatctgtgga 
tagatataga 
ttaatccaaa 
ccttcccaca 



agtcttaata aattcttcct attgggtggt 
taactttaat tactgaacaa tgaagcatgg 
ttaattatac atcatggtcc aagggctcta 
taactatcgg tgtacaaagt cccacagatt 
ccccagtagt tcagtggcta aagtgggctt 
gaaagttttc tggctagaga gttaacagct 
atggctgata aaagtatcaa tgttgtatat 



aataagatat attcaataag gtgaaatgtc 
tagaggttta tactttaata atataattta 
ctttaattaa tattttccac tagcaaaaaa 
agacaccaga accttaggca ggtagtaagc 
taagtaccta aggtttatct ttattcatgt 
gagtgcattt ttagtaaaat actgcaggaa 
ctgcttaaag gagatgcata tctgttcttt 



ctcaaaccac tgttaagctg tcacccatag acttattgta gaaacaatta tctcacagta 



gttttatctc cagacatttt tctgcccaga 
tagtttgttt tagggaggat tgtgttggtt 
tcaggtatac aaacacgtga atgtatttgc 
ggtatagttt caaattgctg aagccaaata 
ttgtcagaat gttggttctc tctcatattt 
aactaatagg tttttatatg ttctaattat 
agtggtaaga gagaaacata tggttagaga 
actgaaaaag tctgctgtga atacagtggt 
gcccatgcaa acagaaaaaa aaatggaatt 
attgacttac acggattaca aatttggtat 
cactctttgc atttttttgc tacatcatct 
agtcaagtta atatttaaag tcacaaagag 
cgtacctgtt gtactatggc atatgtcttt 
aaaaaaggaa ataaaaaatc ccaaaaaaca 
tcagggttga tattcctcaa aagaagacat 
ttcaacatca ctaatcaaaa atcaaaacca 
tggcaattct ccaaaagaca aaagataacg 
attgaacatt gttggtggga atgtgaatta 



caaaagaatc tatggaagta gtcatttctc 
ctcctatgag aaatcacttg cacattatgt 
tatacagagt ccatttctta cacacagcaa 
aaccaaatgg gaagaaattg atggtaaact 
atgaatctgt caaaggatag actttttgta 
ctatcctgtc ttagcaatca 
atatgttaag tgaataggac 
tgtgagatta agtagtaaat 
gtcacaaata taatttgttg 
ccgggcttag tgaataacat 
ttttaaagac tatttatgaa 



gcaaggaaat 
tcagggaaga 
gggcagccat 
gtaatataaa 
attgagcttt 
ggccacaaaa 
tcctagaaat 
agtcaactaa 
aataacccaa 
acaaatgaag 
cagtgagata 
tgtttgcgag 
gtatagcctt 
tttgatccaa 



tttgaacatc acactgactt 
ttatttttct cgttaacaaa 
taaaaataat gagtaaaata 
agcaagtata tgaaaaaatg 
acagctcacc ccagttagaa 



gacgtggaga 
tacgaaaaac 
cagtcctact 
gcgcttccat 
ggacatcaat 
caaccataaa 



aaagagaatc 
agtatggaga 
actgggtaga 
gtttatgaca 
ggaaaaatgg 
aagaatgaaa 



agtaaaagaa accaaacata 
atacattggt cacatgaagg 
gagaaatggg atggagagat 



ttcctcatga aaacaaaaat agaagtatca 
tatccaaaga aaatgaaatc ggtatgtcaa agaagtgtct 
gcattattca caatagccaa ggtatgaaat caacctaagt 
ataaagaaaa tgtggtatgt atacacaatg caatactatt 
tcatgcgaca acaccaatga acctggaaag cattatgtta 
ggaaaacaaa caccgcatgt ttcactcatg tggaattcaa 
tagagtacaa tggtagctgt cagaggctgg gtgtttaggg 

gtcggtcaga gcatacattt acatttgata gaagaaatga atttcaagag atctattata 
tagcaaggtg actctaatta atgaaaatat gattctattc ttgaaaaatg tagaatgaat 
gtaatatgcc atcaccacag aaattgtaac agtgcaaggt 
agatttaaca agtcaacaat gaatatacac tttataacat 
acatacaatg ttatctgtca attaaaaaaa tatctaggaa 
gggtatttat aaaatgcata tttactttgt aagtcaccaa 
gatttatttt gcttaataga atagaataaa taatagaata 
agtgatccct ttagaccttg atgattttat aaatataggt 



aatgcatttg tgaattagct 
catgttgttc atgataaaaa 
gaatctgtac agaagattgt 
atttgtgaat aacattattg 
aattataatt aaatatttaa 
gagatgctag tttgcttacc 
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ctaaatatct 
acttccatca 
atgtgtgtat 
cattagtgat 
tataagcaaa 
gaagcagtac 
aaccagtaag 
ccctcccaac 
aagtgtgttt 
agatacagaa 
tgatttgact 
gttgactctg 
ttgagaaagc 
ttgcatagta 
aagttactca 
actcaaagct 
cgaagcaatg 
tacagcatgt 
ggaacacagt 
tgcatcaccg 
tcctcaccca 
gaagtatttt 
gtttccccca 
tatttgttag 
tgtgatctcg 
ctcccgagta 
taatagagat 
ccgcccacct 
atctgatctg 
tcgctgtgtt 
gtggcctccc 
ccagtcttga 
tcaaggaaag 
aaacgtggct 
tcaagttgtt 
gaaatcttta 
ttaatgttct 
cctggaaaac 
acgatatata 
gagaatgcaa 
tatccattca 
cccccgaacc 
atacagatta 
tggttttctt 
ccttgacatt 
ttaatttagt 
ccttaattaa 
gtatgtggtg 
attctccaat 
tattgatgtg 
tatatgaata 
catgtgcact 
tgaatatgga 
cacacatagc 
aacattggaa 
agctggaatt 
agtctctttc 
aggaagatgt 
tcattgtccc 
gtcttattgg 
aagaattttt 



attttctgcc 
ccaaaattgg 
ttttagcaag 
gcaactgact 
atgacatatc 
tggctaaatc 
gtagagttcc 
tagaattctc 
tattagcttg 
catatgaaga 
gttattccct 
ttccaatcct 
atgacctatg 
caagcctcac 
tctctggaga 
agcacttgca 
aagtaatagt 
taggatgggg 
gcctgacaat 
caagtacctt 
aatctcatct 
tccccacatg 
tgccgttctc 
tttgttttga 
gctcactgca 
gctgagacta 
cgggtttcac 
cagcctctca 
atggttttat 
aagaaggttc 
cagccacgtg 
gggtatttct 
aaaggtacat 
tgatcatgtt 
ctagaaattc 
cagtgaaggt 
agatgtattt 
atttagcctg 
ttttgaaaaa 
ctaatctcaa 
taaaattgat 
acaaaaatgt 
agtaaacatt 
actgggttca 
ttcactgtct 
tttacaatct 
aataaacttt 
atgagtagtg 
tattagctat 
gaaaaagggg 
tgtaaaaatg 
ttgtgtgagg 
cacaaataca 
cctttgctaa 
ggccatgtag 
ctattctact 
tttttcatgt 
atgaaagcac 
ttaacagatg 
atgaaatgat 
gcttcagctc 



ccagaaacaa 
caattaagaa 
aaagccagtt 
tctgcacacc 
attacacata 
catcagtgta 
cgctccttga 
aaatgtgttt 
ggattccatc 
atccctgagt 
tcactcactg 
cagagcaatt 
aaaatgagtt 
tgatgatcac 
ttgtatccca 
ttagtattag 
agaaaattcc 
accacacctc 
aagtatgtgt 
tagaccccaa 
tgaattgtaa 
tggaagaagg 
gtgatagtga 
gacacagtct 
acctctgcct 
caggtgtgtg 
cgtgttagcc 
aagtgctgag 
aagcatctgt 
ttgcttcccc 
gaactgtgag 
ttatagcagt 
tgtagaatat 
tgttcattga 
caactattat 
gaatctggta 
gataagaacc 
tacaaagatg 
aacatttagt 
aattttaaat 
tgcaatatgt 
gtttatttat 
ttagtaatga 
gaaagagaag 
tgctgcaggg 
cattgtatct 
tggtatttgg 
catgaacttt 
gtcagttagg 
caataaatcc 
actaggctag 
gagtgtaata 
gatacatctt 
aatgtgatca 
cttaagatta 
tgtttcctgt 
aaatgtacat 
tgtcacaatg 
tctttaaaaa 
aataactgta 
cttgaccttg 



tcctttaata 
tattttcaca 
ttttgagctg 
atcctacaga 
ataactcatt 
gtattggctc 
tataagaata 
gtcggtggct 
tgtgaagact 
tcctacatag 
gaaaaatagc 
cttgccaaac 
gaagctccaa 
tatccaccat 
gggtctagca 
ctgagaattt 
tcttcatgtg 
ctttttactc 
tgagtaaatg 
agagcataat 
tccccagttt 
gacctaatgg 
gttcttacaa 
tgctctgtca 
cccgagttca 
ccaccacgcc 
aaggcggtgt 
atgacaggtg 
catttcccct 
tttatctttt 
tcaattaaac 
gtgaaaacgg 
caaaatatct 
gaacccactc 
tgaggcattt 
ataatacgta 
ttattctcag 
tatatttgtt 
ctcctaaaat 
aaataaccag 
tagtgctggc 
ttattcattt 
gcttacagat 
tatttttcag 
agttcaggag 
ggcaatttaa 
atagcaggat 
ggaattcaca 
gcaaattgcc 
tatcattcaa 
tccctgacta 
tccaatttct 
taccatttac 
tggaattagg 
aaaacataga 
cctatcttga 
atccatgtca 
accaataaaa 
tatattaaat 
cacatgggtg 
aggttgaaat 



gaacacttta 
cagaaaaaac 
tatgatggag 
agaagaaact 
gtggtgggtt 
tctattctac 
aggattccac 
catccaatat 
ggcatttcta 
ctgtcaagaa 
tcttcccaga 
ttcttttgag 
accagcccat 
atcttagaga 
caagtctcta 
tttaaaaaaa 
aaattcctgc 
acccattata 
aaccgctgtg 
acagatatgg 
tcttactggg 
gtaactgaat 
gatctgatgg 
cccaggttgg 
agcgattctc 
cagctaattt 
cgatctcctg 
tgagccacca 
gcttgcactt 
gccatgatta 
ctcttttctt 
actaatagac 
cagtaattct 
tattggcttt 
gcgtattaaa 
acaacattat 
tattctgagt 
atatttttct 
tataaaaaaa 
tgcttatagt 
agcttagtat 
taaaactact 
tttatacgat 
tggggattgc 
catgtataag 
gacatagaaa 
aatgaatacc 
gaccttggct 
ttaattcttt 
agttgtaatg 
tgggacatag 
gtattatcat 
agactacttt 
atttcataaa 
atttgcagac 
gcaagccact 
tagatttgtg 
ggagaggatt 
ttgaaacatt 
aagattagtg 
ttaagcctcc 



cctttcctta 
cactattgat 
ccttttcact 
catgttcgta 
ctaagcaagt 
ctgttcttaa 
cttctcctgc 
tgcccaagga 
tcttactaac 
gtaaatgaac 
atcaagtatt 
aggtgctgcc 
accacgtgat 
tgggatttag 
cacaagtggt 
aatcggagaa 
taacatgttg 
gcccaacgat 
atcctcccac 
tttggctatg 
ttcagaaaga 
aatgagggca 
ttttgtttgt 
agggcagtgg 
ctgtctcagc 
tatgtattct 
tcctcgtgat 
tgcccggctg 
ctctctcctg 
taagtttctt 
tataaaataa 
aaagaaagcc 
ttttcatagg 
gtatcttgct 
ttacttttta 
gctaaccact 
aaactcgggg 
taattgaaag 
tcattgttta 
gtagttgtat 
aacacatttt 
ttatagtaat 
catgcggttt 
ctagctgcaa 
actattctat 
agtaactttt 
cagacaggca 
tagatttcag 
aaactcaatg 
agcagtaaag 
catttatttc 
actgcaacta 
tccatgtaca 
attgaattat 
agacacaatg 
caacattttc 
atgtctatta 
caataaatgg 
gtttcagagt 
attggaagtc 
tcatctagtt 
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tttttaaaaa 
cagctccctc 
tggcatgctt 
aaagattatc 
aaaattgaat 
tttgcactaa 
attatttatt 
tggcccgatc 
agtctcctga 
ttttagtaga 
gatctgccca 
accaattttt 
ggtcaatgca 
taaggttctt 
atggaataga 
aaagacctca 
caaatcttcg 
gtatgtccca 
cgtgaaagac 
aatggccatc 
gctgtccttt 
ggttgagagg 
attccctgaa 
gtccactaca 
ctgatatccc 
attatggtca 
cttcttatac 
ataaaaacaa 
caacaaaaaa 
tgtggactat 
gagtaaatta 
tacatgtata 
tagatagatg 
gacacatctc 
tactatgtat 
tatcttagca 
tcattttcct 
cactgctttt 
ctgaaatagt 
gacagcctgg 
caaaacttca 
ctggaatttt 
aaagcaaaac 
gtctctcttt 
tttcctcatt 
atgacccagt 
catgcctaaa 
aacttccaat 
ttcaagatgt 
cacaagggct 
gtttgcatta 
gttgaaaata 
attttttaac 
atcaactacc 
catcagtctt 
ctcaggggtg 
tccataagac 
atttcatcac 
atgtgtccat 
agacaattga 
agaatgaacc 



acgtttctaa 
aagcttaatc 
gaacattctt 
ttgatagctt 
aatgttgacc 
ttgcagtttg 
tatttatttt 
tcagctcact 
gtatccggga 
gatgaggttt 
cctcggcctc 
tatttttatt 
actgtggtaa 
ggttgcaagt 
tgtcagagac 
aaggcattaa 
ctccaccact 
gtattcagat 
acataataga 
acattataca 
tgaatttgca 
agaacagaag 
actaagaaga 
gttagaataa 
acatggttaa 
ctgtgattgt 
ttactggtcc 
taatgcattt 
ccagaaagtc 
tttttgactc 
gcatcgtccc 
caaatatata 
tagatatatg 
aagaccctca 
aatttttttc 
acctcagcat 
taatggaagc 
gtgctttgag 
caatctgata 
agaccctgaa 
tcaccatact 
ccacttaata 
tttgaataag 
cacacacata 
tccatacttg 
acatactccc 
gattctaagg 
taatgaacat 
tgtttcccat 
acacagtaag 
tattctttgt 
tctgaagaag 
agcagctgtt 
acccccccaa 
gctcctctat 
ccattagctt 
ctttgtcact 
agaacctgct 
tcttgagccc 
aatattgatg 
aaaacttgaa 



cagaaatata 
tgtatgaact 
tgaataagta 
atacttctgt 
aatttaatga 
taaagttatt 
tgagatggag 
gcaacctctg 
ttacaggcac 
cactatgtta 
ccaaagtgct 
gacaatgctg 
agagaacaga 
gataaaaatt 
aaatgttaag 
cctgcagtgc 
taatagaact 
cttcaaaatt 
tcttcaaaaa 
ccatggactc 
tacctccaca 
ggatgaaatt 
ggagggattt 
ccatatatcc 
acacaccttc 
cattcactcc 
agtgaaaaaa 
attcttttca 
caaaatacta 
taattttttc 
cacaacaata 
cataaatata 
aaaggtatat 
gtggatgcct 
tttgtcacca 
atgatttttt 
actttacagc 
gccattatta 
gctggggtgg 
caaatgaatg 
cagaacagca 
tttttggacc 
gagaaactac 
tacacataca 
attatgaggc 
tctactttcr 
gtttttcttt 
gctggttctg 
ctggcactat 
aaccttgaat 
ttagtagcaa 
gtacagacag 
ctactgaagg 
cccccccctc 
cagggcctct 
ggcaacctgt 
tgacttaata 
gccatgttgc 
tggcacttgg 
acaagaaaat 
gatatttttg 



ctttgtggat 

ttggggaaaa 

gaaaatattg 

ttttaatgta 

gtgtctgtaa 

gaacttcctt 

tcccactctg 

cctcccaggt 

gtgccaacac 

gccaggatgg 

gggattacag 

ctttggataa 

aatgatcatg 

taaagcagga 

gttctgaaat 

ttctgaacgt 

ttgtgacttc 

agactaccct 

tgtccagaaa 

cactggacgt 

aagttcctga 

tctttacttt 

ggatgttaaa 

tagctacctt 

gtggcagaag 

cttttgctgc 

gaccttaata 

gcaaacaaaa 

ttattctgct 

taatcctatc 

ccctacatac 

tatatgcata 

ctatatacag 

gaaaacgtgg 

tttcacagat 

tccctttctt 

ttctgtttgg 

agtcaaataa 

ctagtgagtg 

attcatcccg 

cacaatttaa 

atggtttact 

tctgtgtgtc 

ggaaaggaag 

caaattgagt 

gggaacactt 

ttacatatag' 

tacaccatct 

agtgagtttt 

cctatgggaa 

tgatatataa 

caaaggcaga 

caaatcactg 

accacacaca 

aggttgtgct 

ccattttacc 

aggagacaca 

ttagaaggag 

gttgtcttga 

ttggggaaga 

gatgtttaaa 



cattatacct 
gataagcact 
ctctttcatt 
aataataggt 
taaggggaat 
ttatacttag 
ctgcccagag 
tcaagtgatc 
gcccagctac 
tctccatctc 
gcgtgagtca 
agaaagttac 
cacaatagca 
aaataatcta 
tgggcaataa 
gggctctgga 
gggcaagtga 
tacagctaac 
aaaataaatg 
ggttatggaa 
cctatattgg 
ttctctaagt 
cagccacaca 
gagaaagtcc 
tgtcacaatt 
taaaatgtgc 
acaaaattaa 
caaaaacaca 
atctctaagt 
tcagtatttc 
ggtatctata 
tatacatatc 
taggcctcca 
atagtgccga 
aaaagattca 
tattaagtca 
catagccaaa 
ggatgacttg 
actaatgggt 
agtttgatgg 
aacttatgaa 
gcaggtcatt 
tacatgcaca 
agaagacatg 
atttatgact 
ctgtaatatg 
aatgaaaaga 
accacccaga 
ccatgaatca 
taagctcaat 
aagttgtggt 
cacagaagaa 
cactctccat 
cacgcagtac 
cagtatgaat 
atgtcctagg 
cagtggccag 
aaggagctag 
tggtcaggaa 
gacatgtggc 
aaaggatgac 



cattctaatc 
gtgaataagt 
ttattgacag 
atcaaattct 
ttataaatgt 
atgataattt 
tggagtgcag 
cttctgcttc 
ttttttgtat 
ttgacctcat 
ctgcgcccgg 
tgaacgcctt 
agatttcagt 
ttagaaggct 
gcaaggaagg 
gctgaagttc 
cttaaccttt 
tgagttaaca 
tggccagaac 
aactaatctc 
attttgcctt 
gaaaggtata 
ttttgaaaat 
cagtttacaa 
tggatgataa 
atatattctt 
attttcattt 
acaacaacaa 
cttggatatc 
atcacctaca 
tctatttata 
tatatgtata 
ttacctcaga 
accctatata 
ttttgctgta 
agaacttgca 
tggccagtat 
gacataagca 
ggggagcaga 
agaaagagca 
ttgtttattc 
gaaacctaaa 
tgtataagta 
tggattctag 
tgcctccttc 
ctagaatttt 
gtctttcatt 
acgctgatct 
tccagtattc 
gcctaacatc 
tatagagttt 
aaatatatca 
gattcaagaa 
tatttgctag 
gaattagaca 
catgtttcta 
gaaaagagat 
atatgtgact 
cttggaagga 
tataactctc 
ctcagaatag 
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gaggatttta acaattaaaa gaataggatg acttgttttg tgggtgccag tcagcctctt 313020 
tactcagtcc tccctgtcac ttcccattgg gctcatgact aaaatagcca ttgtgtcagg 313080 
gatggaggtt acgcatgggc tcagctacat ggacttacac tcaccaaggc tgccttccta 313140 
ctatcagtct accagctgta gagaccaaca ctgtgcaccc aatgtgacac tattccccag 313200 
gggatcaggc atctacctgg tggcaggcag atcacagtgg acagcctcca tcatggaaag 313260 
ggaagtattt tgtttctact ggataagatt ttggatgtgg atgtgtctgt cctgaacaca 313320 
atgcttctgc caaacacgga cttacagact gccttatgca ctgtcgcagt attcctcacg 313380 
gcatggcttc tgatcaagga actcacttca cagaaaaaga agagtgccaa tgggccatgc 313440 
tcatgggatt caccggtttt atcatgttcc ccaccgtcct gaagcagctg gcttgataga 313500 
atggtaaaat gaccttttga agacagctct agccccagct aggtaacttc acagggcggg 313560 
ggcgagggtc tctagatggc tgtaaatgct ctaaactaac tatccgatat atggtgtcat 313620 
taatttttat ccaaattcat cttcttcacc tgtgtattat caatagaaaa tagcttaatt 313680 
tgtcaatcta tcttgggtca aagtttacca taaagctttt catgaatttt agtttcaaaa 313740 
atttacatct tatgtaatag tgtgtataca tttatatagt aagaaataat atagtaaaga 313800 
aaatcttaaa cacatggcta agtttggcag aatgtcataa agatcacatt gtacaacaaa 313860 
tgccagaaat tatagtgctg tccataattt tgtttctttg agactgattc cactggtttt 313920 
caaatattgc tacagttgaa acaaaattaa aagaataaaa actcagaatg atcacataga 313980 
gtggctaaat taatgatatt caagttctgt ttcttcatct ttataatcac ggtgttataa 314040 
ttgtttattt caaatcaacc gctcaaacac aggaatccat agcaccaaaa atttttgata 314100 
cagaaacttc actgagggca agcctaaatg atcccaaatc tctatgaacc ttttcttgaa 314160 
atgaatgcat gtaactgagt ccgtgtttgt caaggccatt tgtaaaactg gaagttctct 314220 
aatttaacat ccatgtaaaa tacaatattt ataaaacaac ttgtaataaa tgtatgttaa 314280 
tttgatgtta catatatatg tatatatnca attacatata tatatatata attaaattca 314340 
agagaaacca agtgattgtt taggactgag accaataaaa tatttaataa aatatttttg 314400 
gagggaggac ttgtaatcac gaactggggg gaatatccac gggcaaacca tgaaccagat 314460 
tcaggacaat gtcgttgtcc tctgaatccc aggtaagtcc actttgactc ttaaatggca 314520 
gaaggactct gggttttcaa aacccataag cctgttatct aacgtatttt cgtttcacca 314580 
gtgcacagct cctcttgttt atggatgcag atgcctttcc agaaggctag aaattataga 314640 
aataacattt ggtattttca gagtctgtat atggaagtac ctacctacca tgtttttatt 314700 
taaagaattt tgaatatgtg atttgactca aatttgggaa ttgtgtgagg aacaaaacta 314760 
ttttggtggc ttcaattagg tctatatgca atagaacgtg aatacttttc aaaaacagaa 314820 
atagaacagt gcctacgtga gctagcacca attatggtcc ttggtactcc atggcctatt 314880 
tacagctaca ttaatccatg agtcttataa ttcaaatggc tccagtggct ttgatgttca 314940 
ttagggacac aacagccctt tgccttccct ggagaagtgc tgtgttttgg gttctaacaa 315000 
cataaatact ccccagtccc agtgttttta tggagcccaa ttcaatgtac gatctccccg 315060 
tggcatcacc tacctataat gattaaggat ttgggagatt catttagtga attttatcaa 315120 
gataacaatc ttaacaaaca ctgtagcata tgcacactcc aaaggctttc cctctttcta 315180 
cgttaaatcc aatggattcc tgggactgtg tctcttatag gcttttactg ggcctaaatt 315240 
ttagccaata agcttgtcaa acagctagaa acacattaaa ctcttcaagc ttacctactc 315300 
tatccatgct cttcgggtaa gtgtccactt tgaaattcac cccaatgcaa tcttttatct 315360 
tggttggatg cacaccttca gaacctattc ctaagaaatt cttgcatata ttgttggcat 315420 
aaatgtcttg caattctgtt gttatatcat tttttttctg agtatatttg acccctattt 315480 
aatgcttagc tcttgctcaa gttctaggac tgcaggcaat gaagactggg taagtagggg 315540 
ttaggatgag aattatcttc tcagtctgag gtaacttcct catatgtaaa ggctactaag 315600 
gcaggtttgg aaatgggttt atatatgcat ttccgaatat cctaataata tttggctgct 315660 
tacattagta aaaatcatta aagttgcttt cactcaaaga ctgtaagata taaagaaaag 315720 
ttatttaaaa tgtaaagcat tgaacagaaa acaatcggaa actaatcaat gaaatgggta 315780 
tatataggag gattccaaaa attcggagac tctaaccagt gaatccagcc ttgctagtac 315840 
atccctgcag ccagatacag ttgcacttgc tggaccacta caacatcggg tgccaatcta 315900 
accactgaat ccaacctcct tgtacatcac tgcaaccaga tgcagttgca cttgctggac 315960 
cactacaaca tcgggtgcca atttaatact ttcttctttt cactctctac cccrgtttca 316020 
gtatccttgg cagtatgcta tgttcaactg actttgagat gtatgattct attttgacac 316080 
caacaggctt agaaaagagt aaactgaaca ctctggcttt tgtagggaga agtagaactc 316140 
cgtgtagcac gaagtcttat gcaaaagtag gaagcagtgt aggaggattg gcagaccaaa 316200 
aaatgaaaag tgacaaatat ttatatacag ccagataatt ttgttataag tcaatactct 316260 
agatttttta aatctcccat ctccaaggca atttaggcaa cagcaagtat cttataaaat 316320 
tctctttttt tagaaaatat agatgaatta tttaagatgg attatcatta tttattgtgt 316380 
atttagtata ggtttattta tctcacactt ctttgtcttt ttccattatt ttcagctgtt 316440 
caaaaattct gcatttgtaa caacttcctc aatatgtaaa tttaaaaatc cctcaatatt 316500 
tgccatctta ttctgaagtg tttaccatct tactctaaag taatctgaac accacatgca 316560 
ttattgttgg agttttaaaa cattagtatt tatctgatcc taatgtttta ttgtaattgt 316620 
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atagattttt tcactaattt tggaagggta aactattgtt tcatagatgc aataaatcct 316680 
ggaaacttgt ttgcaatact tttaaaagca gttttcaaaa tttctcctct atcacacagg 316740 
agacacactg taatgactac tatttcctct gatttctttt aggtgcgttt cttttgactg 316800 
tattttcatc agtctcatca tttttctgtg tcatccattt gcgctgggac ttgtgcctac 316860 
tgatggtttt caagtgtggc tgcaatttgg ttcccttgaa ggtattttaa aatatgctga 316920 
tgcactagta ccattcccaa accaactgaa acacattctt caatggcaag gtcatggtgc 316980 
tggtatttaa aacaaaaatc ttcctaagtt attttaatat acaactacaa catattcact 317040 
ggtatagata atattgaata caaaaaaaac ctatatgatg actcgttcct tctgtgagag 317100 
agcaaagtcc acaatttaaa aaaattaata atactgttaa ctcagaggga tcagaccagg 317160 
tggaaagcac gaatatgcac tctgcataat gaagcttatt tccagtgtac aaatctacag 317220 
caagcagaaa aggatacttg gtgcagtttg ctctagatac acagctaaca cagcgagcca 317280 
tgcagtgggg cctccaccta gaggaactga ttacaggaag aggaagtttt gatttcaacc 317340 
ttgttaggct ttccttccag ttagtaatgg tttcaagctg aatggatgac acaacatgaa 317400 
aaattcaact acttaagaat agcatgcgcc aagtatattc tataactttc tgttaagatt 317460 
catttcactt aataattttc tatataaatc tctattaatt taagtttctg aataactgta 317520 
gggtcaacaa catatttggt actctcctac tttgagcttg gttgaaccac ataatttgaa 317580 
agaggatatt ttataaagtt aaaatcctga taactttaaa gactgataaa tttctgtttg 317640 
cattcagaat atagtggcag aaaaaaatat tcctttctaa attataatga ttaaattgtt 317700 
ttgatggtat tatgttgcta taaataattt cctagagata ttaaggtata cttacataag 317760 
atttattcac atttatgata ttttaggctg accttaaaaa ttatttatga atttaattgc 317820 
ttatgcatat gtgataaatg ctaattataa tatttacttg gctttaataa atatttccag 317880 
ttgcaatttg ataggctaac tgtatgtaaa aatttcataa tattgtctag aatagttttt 317940 
ctttattctg atccttttta gttatagtct tatagatatg gccttagttc aaaagcacta 318000 
tacactcgtg aaacatttct aagtcaatgt aaaatattgt aatgtgatgt cagaataaaa 318060 
tctttttgaa gatagagatt ttcatggttt ttttccaaat cagagtttaa agtagtcaat 318120 
ctcatgattt gtcagacttg aaaagttata gaatgaatat tttaacataa aaataactta 318180 
aatgaatatc cttagaatac aaatcaactg tatattcact cgaggcactt aaaatattct 318240 
atatgccaaa aactgcatat ctctttccaa tttttaatac tgctttctct ctctctctct 318300 
tagtttacaa atatttctca atatttaaac ttttagaact gaatcatgtt ggggaaatga 318360 
agtctttggg gaaataattt attatgaaaa atttttatgt agttattaaa aaaattctaa 318420 
ctcagttaaa caatattata tggaaatgcc atagtttgaa aaattgtcaa ggttaagata 318480 
aaacaaaaaa atatttttga cttttttttt cttttctttt attattatac tttaagtttt 318540 
agggtacatg tgcacattgt gcaggttatt tacatatgta tacatgggcc atgctggtgt 318600 
cctgcaccca ttaacttgtc atttagcatt aggtatatct cccaatgcta tccctccccc 318660 
ctccccccac cccacaacan gtccccagag tgtgatgttc cccttcctgt gtccacgtga 318720 
tctcattgtt ncaattccca cctatgagtg agaatatgcg gtgtttggtt ttttgttctt 318780 
gtgatagttt actgagaatg atgatttcca atttcatcca tgtccctaca aaggacatga 318840 
actcatcatt ttttatggct gcatagtatt ccatagtgta tatgtgccac attttcttaa 318900 
tccagtctat cattgttgga catttgggtt ggttccaagt ctttgctatt gtgaataatg 318960 
ccacaataaa catacgtgtg catgtgtctt tacaacaaca tgatttatag tcctttgggt 319020 
atatacccag taatgggatg gctgggtcaa atggtatttc tagttctaga tccctgagga 319080 
atcgccacac tgacttccac aatggttgaa ctagtttaca gtcccaccaa cagtgtaaaa 319140 
gtgttcctat ttctccacat cctctccagc acctgttgtt tcctgacttt ttaatgattg 319200 
ccattctaac tggtgtgaga tggtatctca ttgtggtttt gatttgcatt tctctgatgg 319260 
ccagtgatga tgagcatttt ttcatgtgtc ttttggctgc ataaatgtct tcttttgaga 319320 
agtgtctgtt catatccttt gcccactttt tcatggggtt gtctgttttt ttcttgtaaa 319380 
tttgtttgag ttcattgtag attctggata ttagcccttt gtcagatgag taggttgcga 319440 
aaattttctc ccattttgta ggttccatgt tcactctgat ggtagtttct tttgctgtgc 319500 
agaagctctt taatttagtt agatcccatt tgtcaatttt gtcttttgtt gccattgctt 319560 
ttggtattta gacgtgaagt ccttgcccat gcctatgtcc tgaatggt 319608 

<210> 2 

<211> 2226 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<220> 
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<221> CDS 
<222> 96. .167 

<220> 

<221> 3'UTR 
<222> 168. .2226 

<220> 

<221> polyA_signal 
<222> 2200. .2205 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 2 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctgaac attcaccctg ggagctctgt 277 

gctgccatgc aaatgtcaca ttgeactgag gccgccatgc tgtaaaaaac cccaagctac 337 

ctacctgaaa agctggagag aatcagcacc catcagcttt tccagcattt tgggtcctcc 397 

cagctaaatc ateggcattg tgaagcagag atgagacctc ctgccatgcc ctttccaaat 457 

cctggaccaa cagaatcatc atatacacta ggttttgtgc atctctaaga tgatcatctt 517 

ggtgtggggt cctgatcttc ccagtgactt agcagcacag taatttggct gaaatgttga 577 

tgagttcaga caagaagaag gtgatgatga caagagagcg teaeggggaa attctgccct 637 

gaggaaggaa tagtttgagg ctggagagat gtgctacccc cgccactctt tctcaaatgg 697 

aaagacaacc tgtaggcaca gaaatcattg ctctgcttac actgaaatct cctgctgact 757 

ttgagagctg agcattggat tttgatggga gagtgaaact tagtgaagaa atgatttgaa 817 

cctttgttga tctaatactt ctgtgataca gttatctaaa cagaatgagt ctaaagtcag 877 

tataaagacc tgtgtctctc catgttttgc tgcttataat tgcagaggaa cattttgaga 937 

tagaaaattt tatccctctt tcaagtgact ttttataaaa cataaccttt taagagaggt 997 

cattcttttg aaaaattgtc tgaatacaaa aaagtaactg aacagcaagg gattgtagee 1057 

ataatgttct tgaccacttt agattgacaa atagtatctc aaaattgatc aggttctgga 1117 

gaatctattg aggttgtatg gagtaaccaa gtgatggaag aggttggaag tgaataattt 1177 

tatgtggtat tatgtgattt tttttggtgg gggtagtgtc tggagctgag catgatggag 1237 

cctagatggg gcagccttct aatgctccca gtatttcgea aatcatatgt caegtcaaaa 1297 

ttgtaatgac aaagctcaca gcagctgcct aegecaggtt tggtgccagc tctaattaga 1357 

gcattgatca tttctctccc cttcatgaat gcacatgagt gcacattttc atatataagg 1417 

gcaaaggaca tttcaaagtt tggatggaat ctttcctact tagaagtaat aaaaaacaat 14 77 

actttgaaat cttaccaaat gttaaacaaa tagcaacaat ttttccattt gaagatgtgt 1537 

gtcttaaggt ctgttggatc tttcaaattt ctattttgga catggaaact acagatgaaa 1597 

tttaacttac tcaaagttac acagaactcc tggctgatct cttttactta attcttctaa 1657 

aatctaaagt aagaataatt ttgaaatttt attggtaatg cttaaataac aaaagttact 1717 

ttaatatgtt acatgtttga agtttcattt atacttaata tctaatttaa cgcttgtgaa 1777 

aatattattc agtatttttt tttcttttct aaaatgactt gttcagggct ccttttggga 1837 

ttttaagtga gaaaaactct atttcataat attgttacca tgctcatcca ataacacttc 1897 

atagatggtt agccatctct aaaattaaat tttgtagttg aggagcattg aaaatatcac 1957 

ctatatttgg atttctgttt aacacttcaa aatgcatttt atcataattt aaaatttgat 2017 
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ttaaatacaa acctctttat agaaaacctg ataaatgtag ttttatttgt ttgcttgctt 2077 

ttttttgcag ggagggatat tcccttgata ccactctgtc tttatgactc atcggttatt 2137 

tataaatgtt atcatcgttt ttctgatttt taaataagtt tcagaattaa tacatttaac 2197 

tgaataaatc aaccaataat gtcaaactt 2226 

<210> 3 

<211> 2431 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 r UTR 
<222> 1. . 95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3'UTR 
<222> 168. .2431 

<220> 

<221> polyA_signal 
<222> 2405. .2410 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<220> 

<221> allele 
<222> 328 

<223> 8-155-258 : polymorphic base C or T 
<400> 3 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtccttgaa ggtctccctg ctgtgcttag 277 
agagggggat gctggaagaa gagtggtcag agaagcaaca ttgetgaetc yaaagacaga 337 
gggcaggeca egagecaagg catgeaggag gcctccagaa gctggggcag gcaagagaat 397 
ggactcagcc ctggagcctc cagaaggcat gcagccctgc ccattccttg ettttageca 457 
ggaacattca ccctgggagc tctgtgctgc catgeaaatg teacattgea ctgaggccgc 517 
catgctgtaa aaaaccccaa gctacctacc tgaaaagctg gagagaatca gcacccatca 577 
gcttttccag cattttgggt cctcccagct aaatcategg cattgtgaag cagagatgag 637 
acctcctgcc atgccctttc caaatcctgg accaacagaa tcatcatata cactaggttt 697 
tgtgcatctc taagatgatc atcttggtgt ggggtcctga tcttcccagt gacttagcag 757 
cacagtaatt tggctgaaat gttgatgagt tcagacaaga agaaggtgat gatgacaaga 817 
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gagcgtcacg 
acccccgcca 
cttacactga 
aaacttagtg 
ctaaacagaa 
ataattgcag 
taaaacataa 
aactgaacag 
atctcaaaat 
ggaagaggtt 
gtgtctggag 
tcgcaaatca 
aggtttggtg 
tgagtgcaca 
ctacttagaa 
acaatttttc 
ttggacatgg 
gatctctttt 
taatgcttaa 
taatatctaa 
gacttgttca 
taccatgctc 
agttgaggag 
attttatcat 
tgtagtttta 
ctgtctttat 
aagtttcaga 



gggaaattct 
ctctttctca 
aatctcctgc 
aagaaatgat 
tgagtctaaa 
aggaacattt 
ccttttaaga 
caagggattg 
tgatcaggtt 
ggaagtgaat 
ctgagcatga 
tatgtcacgt 
ccagctctaa 
ttttcatata 
gtaataaaaa 
catttgaaga 
aaactacaga 
acttaattct 
ataacaaaag 
tttaacgctt 
gggctccttt 
atccaataac 
cattgaaaat 
aatttaaaat 
tttgtttgct 
gactcatcgg 
attaatacat 



gccctgagga 
aatggaaaga 
tgactttgag 
ttgaaccttt 
gtcagtataa 
tgagatagaa 
gaggtcattc 
tagccataat 
ctggagaatc 
aattttatgt 
tggagcctag 
caaaattgta 
ttagagcatt 
taagggcaaa 
acaatacttt 
tgtgtgtctt 
tgaaatttaa 
tctaaaatct 
ttactttaat 
gtgaaaatat 
tgggatttta 
acttcataga 
atcacctata 
ttgatttaaa 
tgcttttttt 
ttatttataa 
ttaactgaat 



aggaatagtt 
caacctgtag 
agctgagcat 
gttgatctaa 
agacctgtgt 
aattttatcc 
ttttgaaaaa 
gttcttgacc 
tattgaggtt 
ggtattatgt 
atggggcagc 
atgacaaagc 
gatcatttct 
ggacatttca 
gaaatcttac 
aaggtctgtt 
cttactcaaa 
aaagtaagaa 
atgttacatg 
tattcagtat 
agtgagaaaa 
tggttagcca 
tttggatttc 
tacaaacctc 
tgcagggagg 
atgttatcat 
aaatcaacca 



tgaggctgga 
gcacagaaat 
tggattttga 
tacttctgtg 
ctctccatgt 
ctctttcaag 
ttgtctgaat 
actttagatt 
gtatggagta 
gatttttttt 
cttctaatgc 
tcacagcagc 
ctccccttca 
aagtttggat 
caaatgttaa 
ggatctttca 
gttacacaga 
taattttgaa 
tttgaagttt 
ttttttttct 
actctatttc 
tctctaaaat 
tgtttaacac 
tttatagaaa 
gatattccct 
cgtttttctg 
ataatgtcaa 



gagatgtgct 
cattgctctg 
tgggagagtg 
atacagttat 
tttgctgctt 
tgacttttta 
acaaaaaagt 
gacaaatagt 
accaagtgat 
ggtgggggta 
tcccagtatt 
tgcctacgcc 
tgaatgcaca 
ggaatctttc 
acaaatagca 
aatttctatt 
actcctggct 
attttattgg 
catttatact 
tttctaaaat 
ataatattgt 
taaattttgt 
ttcaaaatgc 
acctgataaa 
tgataccact 
atttttaaat 
actt 



877 
937 
997 
1057 
1117 
1177 
1237 
1297 
1357 
1417 
1477 
1537 
1597 
1657 
1717 
1777 
1837 
1897 
1957 
2017 
2077 
2137 
2197 
2257 
2317 
2377 
2431 



<210> 4 

<211> 2215 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. . 95 



<220> 
<221> CDS 
<222> 96. .311 



<220> 

<221> 3*UTR 
<222> 312. .2215 



<220> 

<221> polyA_signal 
<222> 2189. .2194 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 



<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 4 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 
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aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 



aag 


cct 


gta 


agg 


aga 


tec 


rgt 


etc 


agt 


tgg 


cca 


ggt 


ctt 


gtg 


aaa 


ggt 


161 


Lys 


Pro 


Val 


Arg Arg 


Ser 


Xaa 


Leu 


Ser 


Trp 


Pro 


Gly 


Leu 


Val 


Lys 


Gly 










10 










15 










20 




aac 


etc 


ate 


ace 


gga 


gtc 


aag 


tgt 


cat 


cac 


ctt 


tec 


cat 


gtt 


cca 


ttg 


209 


Asn 


Leu 


lie 
25 


Thr 


Gly 


Val 


Lys 


Cys 
30 


His 


His 


Leu 


Ser 


His 
35 


Val 


Pro 


Leu 




gtt 


aga 


agt 


ccc 


age 


tec 


tgc 


cca 


cat 


tea 


agg 


gaa 


cat 


tea 


ccc 


tgg 


257 


Val 


Arg 


Ser 


Pro 


Ser 


Ser 


Cys 


Pro 


His 


Ser 


Arg 


Glu 


His 


Ser 


Pro 


Trp 






40 










45 










50 










gag 


etc 


tgt 


get 


gee 


atg 


caa 


atg 


tea 


cat 


tgc 


act 


gag 


gee 


gee 


atg 


305 


Glu 


Leu 


Cys 


Ala 


Ala 


Met 


Gin 


Met 


Ser 


His 


Cys 


Thr 


Glu 


Ala 


Ala 


Met 





55 60 65 70 



ctg taa aaaaccccaa gctacctacc tgaaaagctg gagagaatca gcacccatca 361 
Leu * 

gcttttccag cattttgggt cctcccagct aaatcategg cattgtgaag cagagatgag 421 

acctcctgcc atgccctttc caaatcctgg accaacagaa tcatcatata cactaggttt 481 

tgtgcatctc taagatgatc atcttggtgt ggggtcctga tcttcccagt gacttagcag 541 

cacagtaatt tggctgaaat gttgatgagt tcagacaaga agaaggtgat gatgacaaga 601 

gagegtcacg gggaaattct gecctgagga aggaatagtt tgaggctgga gagatgtgct 661 

acccccgcca ctctttctca aatggaaaga caacctgtag gcacagaaat cattgetctg 721 

cttacactga aatctcctgc tgactttgag agctgagcat tggattttga tgggagagtg 781 

aaacttagtg aagaaatgat ttgaaccttt gttgatctaa tacttctgtg atacagttat 841 

ctaaacagaa tgagtctaaa gtcagtataa agacctgtgt ctctccatgt tttgetgett 901 

ataattgeag aggaacattt tgagatagaa aattttatcc ctctttcaag tgacttttta 961 

taaaacataa ccttttaaga gaggtcattc ttttgaaaaa ttgtctgaat acaaaaaagt 1021 

aactgaacag caagggattg tagecataat gttcttgacc actttagatt gacaaatagt 1081 

atctcaaaat tgatcaggtt ctggagaatc tattgaggtt gtatggagta accaagtgat 1141 

ggaagaggtt ggaagtgaat aattttatgt ggtattatgt gatttttttt ggtgggggta ' 1201 

gtgtctggag ctgagcatga tggagectag atggggcagc ettctaatge tcccagtatt 1261 

tegcaaatea tatgtcacgt caaaattgta atgacaaagc tcacagcagc tgcctacgcc 1321 

aggtttggtg ccagctctaa ttagagcatt gatcatttct ctccccttca tgaatgeaca 1381 

tgagtgcaca ttttcatata taagggcaaa ggacatttca aagtttggat ggaatctttc 1441 

ctacttagaa gtaataaaaa acaatacttt gaaatcttac caaatgttaa acaaatagca 1501 

acaatttttc catttgaaga tgtgtgtctt aaggtctgtt ggatctttca aatttctatt 1561 

ttggacatgg aaactacaga tgaaatttaa cttactcaaa gttacacaga actcctggct 1621 

gatctctttt acttaattct tctaaaatct aaagtaagaa taattttgaa attttattgg 1681 

taatgcttaa ataacaaaag ttactttaat atgttacatg tttgaagttt catttatact 1741 

taatatctaa tttaacgett gtgaaaatat tattcagtat ttttttttct tttctaaaat 1801 

gacttgttca gggctccttt tgggatttta agtgagaaaa actctatttc ataatattgt 1861 

taccatgctc atccaataac acttcataga tggttagcca tctctaaaat taaattttgt 1921 

agttgaggag cattgaaaat atcacctata tttggatttc tgtttaacac ttcaaaatgc 1981 

attttatcat aatttaaaat ttgatttaaa tacaaacctc tttatagaaa acctgataaa 2041 

tgtagtttta tttgtttgct tgcttttttt tgcagggagg gatattccct tgataccact 2101 

ctgtctttat gaetcategg ttatttataa atgttatcat cgtttttctg atttttaaat 2161 

aagtttcaga attaatacat ttaactgaat aaatcaacca ataatgtcaa actt 2215 

<210> 5 

<211> 2132 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .212 
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<220> 

<221> 3'UTR 
<222> 213. .2132 

<220> 

<221> polyA_signal 
<222> 2106. .2111 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
«100> 5 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aca ttc acc ctg gga get ctg tgc tgc cat gca aat gtc aca ttg cac 209 
Thr Phe Thr Leu Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 

25 30 35 

tga ggccgccatg ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga 262 

gaatcagcac ccatcagctt ttccagcatt ttgggtcctc ccagctaaat categgcatt 322 

gtgaagcaga gatgagacct cctgccatgc cctttccaaa tcctggacca acagaatcat 382 

catatacact aggttttgtg catctctaag atgatcatct tggtgtgggg tcctgatctt 442 

cccagtgact tagcagcaca gtaatttggc tgaaatgttg atgagttcag acaagaagaa 502 

ggtgatgatg acaagagagc gtcacgggga aattctgccc tgaggaagga atagtttgag 562 

gctggagaga tgtgctaccc ccgccactct ttctcaaatg gaaagacaac ctgtaggcac 622 

agaaatcatt getctgetta cactgaaatc tcctgctgac tttgagagct gagcattgga 682 

ttttgatggg agagtgaaac ttagtgaaga aatgatttga acctttgttg atctaatact 742 

tctgtgatac agttatctaa acagaatgag tctaaagtca gtataaagac ctgtgtctct 802 

ccatgttttg ctgcttataa ttgcagagga acattttgag atagaaaatt ttatccctct 862 

ttcaagtgac tttttataaa acataacctt ttaagagagg tcattctttt gaaaaattgt 922 

ctgaatacaa aaaagtaact gaacagcaag ggattgtagc cataatgttc ttgaccactt 982 

tagattgaca aatagtatct caaaattgat caggttctgg agaatctatt gaggttgtat 1042 

ggagtaacca agtgatggaa gaggttggaa gtgaataatt ttatgtggta ttatgtgatt 1102 

ttttttggtg ggggtagtgt ctggagctga gcatgatgga gectagatgg ggcagccttc 1162 

taatgctccc agtatttege aaatcatatg teaegtcaaa attgtaatga caaagctcac 1222 

agcagctgcc tacgecaggt ttggtgccag ctctaattag agcattgatc atttctctcc 1282 

ccttcatgaa tgcacatgag tgcacatttt catatataag ggcaaaggac atttcaaagt 1342 

ttggatggaa tctttcctac ttagaagtaa taaaaaacaa tactttgaaa tcttaccaaa 1402 

tgttaaacaa atagcaacaa tttttccatt tgaagatgtg tgtcttaagg tctgttggat 1462 

ctttcaaatt tctattttgg acatggaaac tacagatgaa atttaactta ctcaaagtta 1522 

cacagaactc ctggctgatc tcttttactt aattcttcta aaatctaaag taagaataat 1582 

tttgaaattt tattggtaat gcttaaataa caaaagttac tttaatatgt tacatgtttg 1642 

aagtttcatt tatacttaat atctaattta acgcttgtga aaatattatt cagtattttt 1702 

ttttcttttc taaaatgact tgttcagggc tccttttggg attttaagtg agaaaaactc 1762 

tatttcataa tattgttacc atgctcatcc aataacactt catagatggt tagccatctc 1822 

taaaattaaa ttttgtagtt gaggagcatt gaaaatatca cctatatttg gatttctgtt 1882 

taacacttca aaatgcattt tatcataatt taaaatttga tttaaataca aacctcttta 1942 

tagaaaacct gataaatgta gttttatttg tttgettget tttttttgea gggagggata 2002 



127 



WO 01/40493 



PCT/IB00/01507 



ttcccttgat accactctgt ctttatgact catcggttat ttataaatgt tatcatcgtt 
tttctgattt ttaaataagt ttcagaatta atacatttaa ctgaataaat caaccaataa 
tgtcaaactt 

<210> 6 
<211> 1981 
<212> DNA 

<213> Homo sapiens 



2062 
2122 
2132 



<220> 

<221> 5'UTR 
<222> 1. . 95 



<220> 
<221> CDS 
<222> 96. .260 



<220> 

<221> 3 1 UTR 
<222> 261. .1981 



<220> 

<221> polyA_signal 
<222> 1955. .1960 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 6 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa ggt 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aac etc ate acc gga gtc aag tgt cat cac ctt tec cat gtt cca ttg 209 
Asn Leu He Thr Gly Val Lys Cys His His Leu Ser His Val Pro Leu 

25 30 35 

gtt aga agt ccc age tec tgc cca cat tea agg ggt ttt gtg cat etc 257 
Val Arg Ser Pro Ser Ser Cys Pro His Ser Arg Gly Phe Val His Leu 

40 45 50 

taa gatgatcatc ttggtgtggg gtcctgatct tcccagtgac ttagcagcac 310 

55 

agtaatttgg ctgaaatgtt gatgagttca gacaagaaga aggtgatgat gacaagagag 370 

cgtcacgggg aaattctgee ctgaggaagg aatagtttga ggctggagag atgtgctacc 430 

cccgccactc tttctcaaat ggaaagacaa cctgtaggca cagaaatcat tgetctgett 4 90 

acactgaaat ctcctgctga ctttgagagc tgagcattgg attttgatgg gagagtgaaa 550 

cttagtgaag aaatgatttg aacctttgtt gatctaatac ttctgtgata cagttatcta 610 

aacagaatga gtctaaagtc agtataaaga cctgtgtctc tccatgtttt getgettata 670 

attgeagagg aacattttga gatagaaaat tttatccctc tttcaagtga ctttttataa 730 

aacataacct tttaagagag gtcattcttt tgaaaaattg tctgaataca aaaaagtaac 790 

tgaacagcaa gggattgtag ccataatgtt cttgaccact ttagattgac aaatagtatc 850 
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tcaaaattga tcaggttctg gagaatctat tgaggttgta tggagtaacc aagtgatgga 910 

agaggttgga agtgaataat tttatgtggt attatgtgat tttttttggt gggggtagtg 970 

tctggagctg agcatgatgg agcctagatg gggcagcctt ctaatgctcc cagtatttcg 1030 

caaatcatat gtcacgtcaa aattgtaatg acaaagctca cagcagctgc ctacgccagg 1090 

tttggtgcca gctctaatta gagcattgat catttctctc cccttcatga atgcacatga 1150 

gtgcacattt tcatatataa gggcaaagga catttcaaag tttggatgga atctttccta 1210 

cttagaagta ataaaaaaca atactttgaa atcttaccaa atgttaaaca aatagcaaca 1270 

atttttccat ttgaagatgt gtgtcttaag gtctgttgga tctttcaaat ttctattttg 1330 

gacatggaaa ctacagatga aatttaactt actcaaagtt acacagaact cctggctgat 1390 

ctcttttact taattcttct aaaatctaaa gtaagaataa ttttgaaatt ttattggtaa 1450 

tgcttaaata acaaaagtta ctttaatatg ttacatgttt gaagtttcat ttatacttaa 1510 

tatctaattt aacgcttgtg aaaatattat tcagtatttt tttttctttt ctaaaatgac 1570 

ttgttcaggg ctccttttgg gattttaagt gagaaaaact ctatttcata atattgttac 1630 

catgctcatc caataacact tcatagatgg ttagccatct ctaaaattaa attttgtagt 1690 

tgaggagcat tgaaaatatc acctatattt ggatttctgt ttaacacttc aaaatgcatt 1750 

ttatcataat ttaaaatttg atttaaatac aaacctcttt atagaaaacc tgataaatgt 1810 

agttttattt gtttgcttgc ttttttttgc agggagggat attcccttga taccactctg 1870 

tctttatgac tcatcggtta tttataaatg ttatcatcgt ttttctgatt tttaaataag 1930 

tttcagaatt aatacattta actgaataaa tcaaccaata atgtcaaact t 1981 

<210> 7 

<211> 1992 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3'UTR 
<222> 168. .1992 

<220> 

<221> polyA_signal 
<222> 1966. . 1971 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 7 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 
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ggacacacta agtaatagag ttaatcacac cgtcctggtt ttgtgcatct ctaagatgat 
catcttggtg tggggtcctg atcttcccag tgacttagca gcacagtaat ttggctgaaa 
tgttgatgag ttcagacaag aagaaggtga tgatgacaag agagcgtcac ggggaaattc 
tgccctgagg aaggaatagt ttgaggctgg agagatgtgc tacccccgcc actctttctc 
aaatggaaag acaacctgta ggcacagaaa tcattgctct gcttacactg aaatctcctg 
ctgactttga gagctgagca ttggattttg atgggagagt gaaacttagt gaagaaatga 
tttgaacctt tgttgatcta atacttctgt gatacagtta tctaaacaga atgagtctaa 
agtcagtata aagacctgtg tctctccatg ttttgctgct tataattgca gaggaacatt 
ttgagataga aaattttatc cctctttcaa gtgacttttt ataaaacata accttttaag 
agaggtcatt cttttgaaaa attgtctgaa tacaaaaaag taactgaaca gcaagggatt 
gtagccataa tgttcttgac cactttagat tgacaaatag tatctcaaaa ttgatcaggt 
tctggagaat ctattgaggt tgtatggagt aaccaagtga tggaagaggt tggaagtgaa 
taattttatg tggtattatg tgattttttt tggtgggggt agtgtctgga gctgagcatg 
atggagccta gatggggcag ccttctaatg ctcccagtat ttcgcaaatc atatgtcacg 
tcaaaattgt aatgacaaag ctcacagcag ctgcctacgc caggtttggt gccagctcta 
attagagcat tgatcatttc tctccccttc atgaatgcac atgagtgcac attttcatat 
ataagggcaa aggacatttc aaagtttgga tggaatcttt cctacttaga agtaataaaa 
aacaatactt tgaaatctta ccaaatgtta aacaaatagc aacaattttt ccatttgaag 
atgtgtgtct taaggtctgt tggatctttc aaatttctat tttggacatg gaaactacag 
atgaaattta acttactcaa agttacacag aactcctggc tgatctcttt tacttaattc 
ttctaaaatc taaagtaaga ataattttga aattttattg gtaatgctta aataacaaaa 
gttactttaa tatgttacat gtttgaagtt tcatttatac ttaatatcta atttaacgct 
tgtgaaaata ttattcagta tttttttttc ttttctaaaa tgacttgttc agggctcctt 
ttgggatttt aagtgagaaa aactctattt cataatattg ttaccatgct catccaataa 
cacttcatag atggttagcc atctctaaaa ttaaattttg tagttgagga gcattgaaaa 
tatcacctat atttggattt ctgtttaaca cttcaaaatg cattttatca taatttaaaa 
tttgatttaa atacaaacct ctttatagaa aacctgataa atgtagtttt atttgtttgc 
ttgctttttt ttgcagggag ggatattccc ttgataccac tctgtcttta tgactcatcg 
gttatttata aatgttatca tcgtttttct gatttttaaa taagtttcag aattaataca 
tttaactgaa taaatcaacc aataatgtca aactt 

<210> 8 

<211> 1898 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .182 

<220> 

<221> 3'UTR 
<222> 183. .1898 

<220> 

<221> polyA_signal 
<222> 1872. .1877 

<220> 

<221> allele 

<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 

<222> 132 

<223> 8-126-286 : polymorphic base A or G 



277 
337 
397 
457 
517 
577 
637 
697 
757 
817 
877 
937 
997 
1057 
1117 
1177 
1237 
1297 
1357 
1417 
1477 
1537 
1597 
1657 
1717 
1777 
1837 
1897 
1957 
1992 
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<400> 8 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa ggg 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Glv 

10 15 20 

ttt tgt gca tct eta aga tga tcatcttggt gtggggtcct gatcttccca 212 
Phe Cys Ala Ser Leu Arg * 
25 

gtgacttagc agcacagtaa tttggctgaa atgttgatga gttcagacaa gaagaaggtg 272 

atgatgacaa gagagegtea eggggaaatt ctgccctgag gaaggaatag tttgaggctg 332 

gagagatgtg ctacccccgc cactctttct caaatggaaa gacaacctgt aggcacagaa 392 

ateattgetc tgcttacact gaaatctcct gctgactttg agagctgagc attggatttt 452 

gatgggagag tgaaacttag tgaagaaatg atttgaacct ttgttgatct aatacttctg 512 

tgatacagtt atctaaacag aatgagtcta aagtcagtat aaagacctgt gtctctccat 572 

gttttgctgc ttataattgc agaggaacat tttgagatag aaaattttat ccctctttca 632 

agtgactttt tataaaacat aaccttttaa gagaggtcat tcttttgaaa aattgtctga 692 

atacaaaaaa gtaactgaac agcaagggat tgtagecata atgttcttga ccactttaga 752 

ttgacaaata gtatctcaaa attgatcagg ttctggagaa tctattgagg ttgtatggag 812 

taaccaagtg atggaagagg ttggaagtga ataattttat gtggtattat gtgatttttt 872 

ttggtggggg tagtgtctgg agctgagcat gatggagect agatggggca gecttctaat 932 

gctcccagta tttcgcaaat catatgtcac gtcaaaattg taatgacaaa gctcacagca 992 

gctgcctacg ccaggtttgg tgccagctct aattagagca ttgatcattt ctctcccctt 1052 

catgaatgea catgagtgea cattttcata tataagggca aaggacattt caaagtttgg 1112 

atggaatctt tcctacttag aagtaataaa aaacaatact ttgaaatctt accaaatgtt 1172 

aaacaaatag caacaatttt tccatttgaa gatgtgtgtc ttaaggtctg ttggatcttt 1232 

caaatttcta ttttggacat ggaaactaca gatgaaattt aacttactca aagttacaca 1292 

gaactcctgg ctgatctctt ttacttaatt cttctaaaat ctaaagtaag aataattttg 1352 

aaattttatt ggtaatgett aaataacaaa agttacttta atatgttaca tgtttgaagt 1412 

ttcatttata cttaatatct aatttaaege ttgtgaaaat attattcagt attttttttt 1472 

cttttctaaa atgacttgtt cagggctcct tttgggattt taagtgagaa aaactctatt 1532 

tcataatatt gttaccatgc tcatccaata acacttcata gatggttagc catctctaaa 1592 

attaaatttt gtagttgagg agcattgaaa atatcaccta tatttggatt tctgtttaac 1652 

acttcaaaat gcattttatc ataatttaaa atttgattta aatacaaacc tctttataga 1712 

aaacctgata aatgtagttt tatttgtttg ettgettttt tttgcaggga gggatattcc 1772 

cttgatacca ctctgtcttt atgactcatc ggttatttat aaatgttatc ategttttte 1832 
tgatttttaa ataagtttca gaattaatac atttaactga ataaatcaac caataatgtc 



aaactt 

<210> 9 

<211> 1783 

<212> DNA 

<213> Homo sapiens 



1892 
1898 



<220> 

<221> 5'UTR 
<222> 1. . 95 



<220> 
<221> CDS 
<222> 96. .167 



<220> 

<221> 3'UTR 
<222> 168. . 1783 

<220> 

<221> polyA_signal 
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<222> 1757. .1762 
<220> 

<221> allele 
<222> 132 
<223> 8-130-143 

<220> 

<221> allele 
<222> 132 
<223> 8-126-286 



polymorphic base A or G 



polymorphic base A or G 



<400> 9 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct * 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctgaaa gacaacctgt aggcacagaa 277 

ateattgetc tgcttacact gaaatctcct gctgactttg agagctgagc attggatttt 337 

gatgggagag tgaaacttag tgaagaaatg atttgaacct ttgttgatct aatacttctg 397 

tgatacagtt atctaaacag aatgagtcta aagtcagtat aaagacctgt gtctctccat 457 

gttttgctgc ttataattgc agaggaacat tttgagatag aaaattttat ccctctttca 517 

agtgactttt tataaaacat aaccttttaa gagaggtcat tcttttgaaa aattgtctga 577 

atacaaaaaa gtaactgaac agcaagggat tgtagecata atgttcttga ccactttaga 637 

ttgacaaata gtatctcaaa attgatcagg ttctggagaa tctattgagg ttgtatggag 697 

taaccaagtg atggaagagg ttggaagtga ataattttat gtggtattat gtgatttttt 757 

ttggtggggg tagtgtctgg agctgagcat gatggagect agatggggca gecttctaat 817 

gctcccagta tttcgcaaat catatgtcac gtcaaaattg taatgacaaa gctcacagca 877 

gctgcctacg ccaggtttgg tgccagctct aattagagca ttgatcattt ctctcccctt 937 

catgaatgea catgagtgea cattttcata tataagggca aaggacattt caaagtttgg 997 

atggaatctt tcctacttag aagtaataaa aaacaatact ttgaaatctt accaaatgtt 1057 

aaacaaatag caacaatttt tccatttgaa gatgtgtgtc ttaaggtctg ttggatcttt 1117 

caaatttcta ttttggacat ggaaactaca gatgaaattt aacttactca aagttacaca 1177 

gaactcctgg ctgatctctt ttacttaatt cttctaaaat ctaaagtaag aataattttg 1237 

aaattttatt ggtaatgett aaataacaaa agttacttta atatgttaca tgtttgaagt 1297 

ttcatttata cttaatatct aatttaaege ttgtgaaaat attattcagt attttttttt 1357 

cttttctaaa atgacttgtt cagggctcct tttgggattt taagtgagaa aaactctatt 1417 

tcataatatt gttaccatgc tcatccaata acacttcata gatggttagc catctctaaa 1477 

attaaatttt gtagttgagg agcattgaaa atatcaccta tatttggatt tctgtttaac 1537 

acttcaaaat gcattttatc ataatttaaa atttgattta aatacaaacc tctttataga 1597 

aaacctgata aatgtagttt tatttgtttg ettgettttt tttgcaggga gggatattcc 1657 

cttgatacca ctctgtcttt atgactcatc ggttatttat aaatgttatc ategttttte 1717 

tgatttttaa ataagtttca gaattaatac atttaactga ataaatcaac caataatgtc 1777 

aaactt 1783 

<210> 10 

<211> 2309 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 



<220> 
<221> CDS 



132 
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<222> 96. .311 
<220> 

<221> 3*UTR 
<222> 312. .2309 

<220> 

<221> polyA_signal 
<222> 2283. .2288 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 10 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 

1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa ggt 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aac etc ate acc gga gtc aag tgt cat cac ctt tec cat gtt cca ttg 209 
Asn Leu He Thr Gly Val Lys Cys His His Leu Ser His Val Pro Leu 

25 30 35 

gtt aga agt ccc age tec tgc cca cat tea agg gaa aat aac gca ttt . 257 
Val Arg Ser Pro Ser Ser Cys Pro His Ser Arg Glu Asn Asn Ala Phe 

40 45 50 

ata aaa gec age tea gat gtt act tec aac caa caa att tec tgg aca 305 
He Lys Ala Ser Ser Asp Val Thr Ser Asn Gin Gin He Ser Trp Thr 
55 60 65 70 

cac taa gtaatagagt taatcacacc gtcctgaaca ttcaccctgg gagctctgtg 361 

ctgccatgca aatgtcacat tgcactgagg ccgccatgct gtaaaaaacc ccaagctacc 421 

tacctgaaaa gctggagaga atcagcaccc atcagctttt ccagcatttt gggtcctccc 481 

agctaaatca teggcattgt gaagcagaga tgagacctcc tgccatgccc tttccaaatc 541 

ctggaccaac agaatcatca tatacactag gttttgtgca tctctaagat gatcatcttg 601 

gtgtggggtc ctgatcttcc cagtgactta gcagcacagt aatttggctg aaatgttgat 661 

gagttcagac aagaagaagg tgatgatgac aagagagcgt caeggggaaa ttctgccctg 721 

aggaaggaat agtttgaggc tggagagatg tgctaccccc gccactcttt ctcaaatgga 781 

aagacaacct gtaggcacag aaatcattgc tetgettaca ctgaaatctc ctgetgaett 841 

tgagagctga gcattggatt ttgatgggag agtgaaactt agtgaagaaa tgatttgaac 901 

ctttgttgat ctaatacttc tgtgatacag ttatctaaac agaatgagtc taaagtcagt 961 

ataaagacct gtgtctctcc atgttttgct gcttataatt gcagaggaac attttgagat 1021 

agaaaatttt atccctcttt caagtgactt tttataaaac ataacctttt aagagaggtc 1081 

attcttttga aaaattgtct gaatacaaaa aagtaactga acagcaaggg attgtageca 1141 

taatgttctt gaccacttta gattgacaaa tagtatctca aaattgatca ggttctggag 1201 

aatctattga ggttgtatgg agtaaccaag tgatggaaga ggttggaagt gaataatttt 1261 

atgtggtatt atgtgatttt ttttggtggg ggtagtgtct ggagctgagc atgatggagc 1321 

ctagatgggg cagccttcta atgctcccag tatttegcaa atcatatgtc aegtcaaaat 1381 

tgtaatgaca aagctcacag cagctgccta cgccaggttt ggtgccagct ctaattagag 1441 

cattgatcat ttctctcccc ttcatgaatg cacatgagtg cacattttca tatataaggg 1501 

caaaggacat ttcaaagttt ggatggaatc tttcctactt agaagtaata aaaaacaata 1561 

ctttgaaatc ttaccaaatg ttaaacaaat agcaacaatt tttccatttg aagatgtgtg 1621 

tcttaaggtc tgttggatct ttcaaatttc tattttggac atggaaacta cagatgaaat 1681 
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ttaacttact caaagttaca cagaactcct ggctgatctc ttttacttaa ttcttctaaa 1741 

atctaaagta agaataattt tgaaatttta ttggtaatgc ttaaataaca aaagttactt 1801 

taatatgtta catgtttgaa gtttcattta tacttaatat ctaatttaac gcttgtgaaa 1861 

atattattca gtattttttt ttcttttcta aaatgacttg ttcagggctc cttttgggat 1921 

tttaagtgag aaaaactcta tttcataata ttgttaccat gctcatccaa taacacttca 1981 

tagatggtta gccatctcta aaattaaatt ttgtagttga ggagcattga aaatatcacc 2041 

tatatttgga tttctgttta acacttcaaa atgcatttta tcataattta aaatttgatt 2101 

taaatacaaa cctctttata gaaaacctga taaatgtagt tttatttgtt tgcttgcttt 2161 

tttttgcagg gagggatatt cccttgatac cactctgtct ttatgactca tcggttattt 2221 

ataaatgtta tcatcgtttt tctgattttt aaataagttt cagaattaat acatttaact 2281 

gaataaatca accaataatg tcaaactt 2309 

<210> 11 

<211> 2075 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'CJTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .311 

<220> 

<221> 3*UTR 
<222> 312. .2075 

<220> 

<221> polyA_signal 
<222> 2049. .2054 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 11 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 



aag 


cct 


gta 


agg 


aga 


tec 


rgt 


etc 


agt 


tgg 


cca 


ggt 


ctt 


gtg 


aaa 


ggt 


161 


Lys 


Pro 


Val 


Arg 


Arg 


Ser 


Xaa 


Leu 


Ser 


Trp 


Pro 


Gly 


Leu 


Val 


Lys 


Gly 










10 










15 










20 








aac 


etc 


ate 


acc 


gga 


gtc 


aag 


tgt 


cat 


cac 


ctt 


tec 


cat 


gtt 


cca 


ttg 


209 


Asn 


Leu 


lie 


Thr 


Gly 


Val 


Lys 


Cys 


His 


His 


Leu 


Ser 


His 


Val 


Pro 


Leu 








25 










30 










35 










gtt 


aga 


agt 


ccc 


age 


tec 


tgc 


cca 


cat 


tea 


agg 


gaa 


aat 


aac 


gca 


ttt 


257 


Val 


Arg 


Ser 


Pro 


Ser 


Ser 


Cys 


Pro 


His 


Ser 


Arg 


Glu 


Asn 


Asn 


Ala 


Phe 






40 










45 










50 












ata 


aaa 


gee 


age 


tea 


gat 


gtt 


act 


tec 


aac 


caa 


caa 


att 


tec 


tgg 


aca 


305 


He 


Lys 


Ala 


Ser 


Ser 


Asp 


Val 


Thr 


Ser 


Asn 


Gin 


Gin 


He 


Ser 


Trp 


Thr 




55 










60 










65 








70 




cac 


taa 


gtaatagagt taatcacacc gtcctggttt 


tgtgcatctc 


taagatgatc 


361 
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His * 

atcttggtgt ggggtcctga tcttcccagt gacttagcag cacagtaatt tggctgaaat 421 

gttgatgagt tcagacaaga agaaggtgat gatgacaaga gagcgtcacg gggaaattct 481 

gccctgagga aggaatagtt tgaggctgga gagatgtgct acccccgcca ctctttctca 541 

aatggaaaga caacctgtag gcacagaaat cattgctctg cttacactga aatctcctgc 601 

tgactttgag agctgagcat tggattttga tgggagagtg aaacttagtg aagaaatgat 661 

ttgaaccttt gttgatctaa tacttctgtg atacagttat ctaaacagaa tgagtctaaa 721 

gtcagtataa agacctgtgt ctctccatgt tttgctgctt ataattgcag aggaacattt 781 

tgagatagaa aattttatcc ctctttcaag tgacttttta taaaacataa ccttttaaga 841 

gaggtcattc ttttgaaaaa ttgtctgaat acaaaaaagt aactgaacag caagggattg 901 

tagccataat gttcttgacc actttagatt gacaaatagt atctcaaaat tgatcaggtt 961 

ctggagaatc tattgaggtt gtatggagta accaagtgat ggaagaggtt ggaagtgaat 1021 

aattttatgt ggtattatgt gatttttttt ggtgggggta gtgtctggag ctgagcatga 1081 

tggagcctag atggggcagc cttctaatgc tcccagtatt tcgcaaatca tatgtcacgt 1141 

caaaattgta atgacaaagc tcacagcagc tgcctacgcc aggtttggtg ccagctctaa 1201 

ttagagcatt gatcatttct ctccccttca tgaatgcaca tgagtgcaca ttttcatata 1261 

taagggcaaa ggacatttca aagtttggat ggaatctttc ctacttagaa gtaataaaaa 1321 

acaatacttt gaaatcttac caaatgttaa acaaatagca acaatttttc catttgaaga 1381 

tgtgtgtctt aaggtctgtt ggatctttca aatttctatt ttggacatgg aaactacaga 1441 

tgaaatttaa cttactcaaa gttacacaga actcctggct gatctctttt acttaattct 1501 

tctaaaatct aaagtaagaa taattttgaa attttattgg taatgcttaa ataacaaaag 1561 

ttactttaat atgttacatg tttgaagttt catttatact taatatctaa tttaacgctt 1621 

gtgaaaatat tattcagtat ttttttttct tttctaaaat gacttgttca gggctccttt 1681 

tgggatttta agtgagaaaa actctatttc ataatattgt taccatgctc atccaataac 1741 

acttcataga tggttagcca tctctaaaat taaattttgt agttgaggag cattgaaaat 1801 

atcacctata tttggatttc tgtttaacac ttcaaaatgc attttatcat aatttaaaat 1861 

ttgatttaaa tacaaacctc tttatagaaa acctgataaa tgtagtttta tttgtttgct 1921 

tgcttttttt tgcagggagg gatattccct tgataccact ctgtctttat gactcatcgg 1981 

ttatttataa atgttatcat cgtttttctg atttttaaat aagtttcaga attaatacat 2041 

ttaactgaat aaatcaacca ataatgtcaa actt 2075 

<210> 12 

<211> 1923 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .95 



<220> 
<221> CDS 
<222> 96. .212 



<220> 

<221> 3'UTR 
<222> 213. .1923 



<220> 

<221> polyA_signal 
<222> 1897. .1902 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
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<400> 12 
ttgcaggtcc acat 
aactcatagt ttct 



aag cct gta agg 
Lys Pro Val Arg 
10 

aca ttc acc ctg 
Thr Phe Thr Leu 
25 

tga ggccgccatg 



tcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 
tgccat gaagactttt ccaac atg get get tac ttc ttc 

Met Ala Ala Tyr Phe Phe 
1 5 
aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 
Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

15 20 
gga get ctg tgc tgc cat gca aat gtc aca ttg cac 
Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 
30 35 
ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga 



gaatcagcac 
gtgaagcaga 
catatacact 
ctcctgctga 
aaatgatttg 
gtctaaagtc 
aacattttga 
tttaagagag 
gggattgtag 
tcaggttctg 
agtgaataat 
agcatgatgg 
gtcacgtcaa 
gctctaatta 
tcatatataa 
ataaaaaaca 
ttgaagatgt 
ctacagatga 
taattcttct 
acaaaagtta 
aacgcttgtg 
ctccttttgg 
caataacact 
tgaaaatatc 
ttaaaatttg 
gtttgcttgc 
teateggtta 
aatacattta 



ccatcagctt 
gatgagacct 
agaaagacaa 
ctttgagagc 
aacctttgtt 
agtataaaga 
gatagaaaat 
gtcattcttt 
ccataatgtt 
gagaatctat 
tttatgtggt 
agectagatg 
aattgtaatg 
gagcattgat 
gggcaaagga 
atactttgaa 
gtgtcttaag 
aatttaactt 
aaaatctaaa 
ctttaatatg 
aaaatattat 
gattttaagt 
tcatagatgg 
acctatattt 
atttaaatac 
ttttttttgc 
tttataaatg 
actgaataaa 



ttccagcatt 
cctgccatgc 
cctgtaggca 
tgagcattgg 
gatctaatac 
cctgtgtctc 
tttatccctc 
tgaaaaattg 
cttgaccact 
tgaggttgta 
attatgtgat 
gggcagcett 
acaaagctca 
catttctctc 
catttcaaag 
atcttaccaa 
gtctgttgga 
actcaaagtt 
gtaagaataa 
ttacatgttt 
tcagtatttt 
gagaaaaact 
ttagecatet 
ggatttctgt 
aaacctcttt 
agggagggat 
ttatcatcgt 
tcaaccaata 



ttgggtcctc 
cctttccaaa 
cagaaatcat 
attttgatgg 
ttctgtgata 
tccatgtttt 
tttcaagtga 
tctgaataca 
ttagattgac 
tggagtaacc 
tttttttggt 
ctaatgctcc 
cagcagctgc 
cccttcatga 
tttggatgga 
atgttaaaca 
tctttcaaat 
acacagaact 
ttttgaaatt 
gaagtttcat 
tttttctttt 
ctatttcata 
ctaaaattaa 
ttaacacttc 
atagaaaacc 
attcccttga 
ttttctgatt 
atgtcaaact 



ccagctaaat 
tcctggacca 
tgetctgett 
gagagtgaaa 
cagttatcta 
getgettata 
ctttttataa 
aaaaagtaac 
aaatagtatc 
aagtgatgga 
gggggtagtg 
cagtatttcg 
ctacgccagg 
atgcacatga 
atctttccta 
aatagcaaca 
ttctattttg 
cctggctgat 
ttattggtaa 
ttatacttaa 
ctaaaatgac 
atattgttac 
attttgtagt 
aaaatgeatt 
tgataaatgt 
taccactctg 
tttaaataag 
t 



categgcatt 
acagaatcat 
acactgaaat 
cttagtgaag 
aacagaatga 
attgeagagg 
aacataacct 
tgaacagcaa 
tcaaaattga 
agaggttgga 
tctggagctg 
caaatcatat 
tttggtgcca 
gtgcacattt 
cttagaagta 
atttttccat 
gacatggaaa 
ctcttttact 
tgcttaaata 
tatctaattt 
ttgttcaggg 
catgctcatc 
tgaggagcat 
ttatcataat 
agttttattt 
tctttatgac 
tttcagaatt 



60 
113 



161 



209 



262 

322 
382 
442 
502 
562 
622 
682 
742 
802 
862 
922 
982 
1042 
1102 
1162 
1222 
1282 
1342 
1402 
1462 
1522 
1582 
1642 
1702 
1762 
1822 
1882 
1923 



<210> 13 

<211> 2308 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1. .95 



<220> 
<221> CDS 
<222> 96. .167 



<220> 

<221> 3'UTR 
<222> 168. .2308 



<220> 
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<221> polyA_signal 
<222> 2282. .2287 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 13 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagq 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Glv 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 

Lys + x 

ggacacacta agtaatagag ttaatcacac cgtcctaaac aatacgaaca gaaacaaagc 277 

tatgtacttg ttttgggaag caacctcaaa tggaaagatg ggggctgtgt tctaaaagga 337 

acattcaccc tgggagctct gtgetgecat gcaaatgtca cattgeactg aggccgccat 397 

gctgtaaaaa accccaagct acctacctga aaagctggag agaatcagca cccatcagct 457 

tttccagcat tttgggtcct cccagctaaa teateggcat tgtgaagcag agatgagacc 517 

tcctgccatg ccctttccaa atcctggacc aacagaatca tcatatacac taggttttgt 577 

gcatctctaa gatgatcatc ttggtgtggg gtcctgatct tcccagtgac ttagcagcac 637 

agtaatttgg ctgaaatgtt gatgagttca gacaagaaga aggtgatgat gacaagagag 697 

cgtcacgggg aaattctgee ctgaggaagg aatagtttga ggctggagag atgtgctacc 757 

cccgccactc tttctcaaat ggaaagacaa cctgtaggca cagaaatcat tgetctgett 817 

acactgaaat ctcctgctga ctttgagagc tgagcattgg attttgatgg gagagtgaaa 877 

cttagtgaag aaatgatttg aacctttgtt gatctaatac ttctgtgata cagttatcta 937 

aacagaatga gtctaaagtc agtataaaga cctgtgtctc tccatgtttt getgettata 997 

attgeagagg aacattttga gatagaaaat tttatccctc tttcaagtga ctttttataa 1057 

aacataacct tttaagagag gtcattcttt tgaaaaattg tctgaataca aaaaagtaac 1117 

tgaacagcaa gggattgtag ccataatgtt cttgaccact ttagattgac aaatagtatc 1177 

tcaaaattga tcaggttctg gagaatctat tgaggttgta tggagtaacc aagtgatgga 1237 

agaggttgga agtgaataat tttatgtggt attatgtgat tttttttggt gggggtagtg 1297 

tctggagctg agcatgatgg agectagatg gggcagcett ctaatgctcc cagtatttcg 1357 

caaatcatat gtcacgtcaa aattgtaatg acaaagctca cagcagctgc ctacgccagg 1417 

tttggtgcca gctctaatta gagcattgat catttctctc cccttcatga atgcacatga 1477 

gtgcacattt tcatatataa gggcaaagga catttcaaag tttggatgga atctttccta 1537 

cttagaagta ataaaaaaca atactttgaa atcttaccaa atgttaaaca aatagcaaca 1597 

atttttccat ttgaagatgt gtgtcttaag gtctgttgga tctttcaaat ttctattttg 1657 

gacatggaaa ctacagatga aatttaactt actcaaagtt acacagaact cctggctgat 1717 

ctcttttact taattcttct aaaatctaaa gtaagaataa ttttgaaatt ttattggtaa 1777 

tgcttaaata acaaaagtta ctttaatatg ttacatgttt gaagtttcat ttatacttaa 1837 

tatctaattt aacgcttgtg aaaatattat tcagtatttt tttttctttt ctaaaatgac 1897 

ttgttcaggg ctccttttgg gattttaagt gagaaaaact ctatttcata atattgttac 1957 

catgctcatc caataacact tcatagatgg ttagecatet ctaaaattaa attttgtagt 2017 

tgaggagcat tgaaaatatc acctatattt ggatttctgt ttaacacttc aaaatgeatt 2077 

ttatcataat ttaaaatttg atttaaatac aaacctcttt atagaaaacc tgataaatgt 2137 

agttttattt gtttgcttgc ttttttttgc agggagggat attcccttga taccactctg 2197 

tctttatgac teateggtta tttataaatg ttatcatcgt ttttctgatt tttaaataag 2257 

tttcagaatt aatacattta actgaataaa tcaaccaata atgtcaaact t 2308 

<210> 14 
<211> 2706 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5*UTR 
<222> 1. . 95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3'UTR 
<222> 168 . .2706 

<220> 

<221> polyA_signal 
<222> 2680. .2685 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 14 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctgaac attcaccctg ggagctctgt 277 
gctgccatgc aaatgtcaca ttgeactgag gccgccatgc tgtaaaaaac cccaagctac 337 
ctacctgaaa agctggagag aatcagcacc catcagcttt tccagcattt tgggtcctcc 397 
cagctaaatc ateggcattg tgaagcagag atgagacctc ctgccatgcc ctttccaaat 457 
cctggaccaa cagaatcatc atatacacta gtaagctttt actttaagee actaatttgg 517 
ggtgtagctc ttttgeagea aaagatcatc agaacactgc catgeaggag ctgataaact 577 
tgctcactgt tatgtatgta tccaatgact actcacacac acattacttg cttcactcta 637 
ctttctggct ttccattcag caaattttaa gaaatgaaaa tgttattcgt gggaaattaa 697 
ctcaagatca atattgttgt agttaatctc tttgtaaatg gtaaaatatt aaattattgt 757 
gagtcatttt tatttaaaga agtttggttc tttaaatgaa agaagtttgg gtcttttgtg 817 
gatactctgc atatagacat tctagacatg aaagaatgtt gaagacaaag aatgttgaag 877 
aaatatagac atgaaagact gttgaagttt ggttttttgt ggatactctg catatagaca 937 
ttctagactg gagtgataca tcttgttcag ggttttgtgc atctctaaga tgatcatctt 997 
ggtgtggggt cctgatcttc ccagtgactt agcagcacag taatttggct gaaatgttga 1057 
tgagttcaga caagaagaag gtgatgatga caagagagcg teaeggggaa attctgccct 1117 
gaggaaggaa tagtttgagg ctggagagat gtgctacccc cgccactctt tctcaaatgg 1177 
aaagacaacc tgtaggcaca gaaatcattg ctctgcttac actgaaatct cctgctgact 1237 
ttgagagctg agcattggat tttgatggga gagtgaaact tagtgaagaa atgatttgaa 1297 
cctttgttga tctaatactt ctgtgataca gttatctaaa cagaatgagt ctaaagtcag 1357 
tataaagacc tgtgtctctc catgttttgc tgcttataat tgcagaggaa cattttgaga 1417 
tagaaaattt tatccctctt tcaagtgact ttttataaaa cataaccttt taagagaggt 1477 
cattcttttg aaaaattgtc tgaatacaaa aaagtaactg aacagcaagg gattgtagee 1537 
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ataatgttct tgaccacttt agattgacaa atagtatctc aaaattgatc aggttctgga 1597 

gaatctattg aggttgtatg gagtaaccaa gtgatggaag .aggttggaag tgaataattt 1657 

tatgtggtat tatgtgattt tttttggtgg gggtagtgtc tggagctgag catgatggag 1717 

cctagatggg gcagccttct aatgctccca gtatttcgca aatcatatgt cacgtcaaaa 1777 

ttgtaatgac aaagctcaca gcagctgcct acgccaggtt tggtgccagc tctaattaga 1837 

gcattgatca tttctctccc cttcatgaat gcacatgagt gcacattttc atatataagg 1897 

gcaaaggaca tttcaaagtt tggatggaat ctttcctact tagaagtaat aaaaaacaat 1957 

actttgaaat cttaccaaat gttaaacaaa tagcaacaat ttttccattt gaagatgtgt 2017 

gtcttaaggt ctgttggatc tttcaaattt ctattttgga catggaaact acagatgaaa 2077 

tttaacttac tcaaagttac acagaactcc tggctgatct cttttactta attcttctaa 2137 

aatctaaagt aagaataatt ttgaaatttt attggtaatg cttaaataac aaaagttact 2197 

ttaatatgtt acatgtttga agtttcattt atacttaata tctaatttaa cgcttgtgaa 2257 

aatattattc agtatttttt tttcttttct aaaatgactt gttcagggct ccttttggga 2317 

ttttaagtga gaaaaactct atttcataat attgttacca tgctcatcca ataacacttc 2377 

atagatggtt agccatctct aaaattaaat tttgtagttg aggagcattg aaaatatcac 2437 

ctatatttgg atttctgttt aacacttcaa aatgcatttt atcataattt aaaatttgat 2497 

ttaaatacaa acctctttat agaaaacctg ataaatgtag ttttatttgt ttgcttgctt 2557 

ttttttgcag ggagggatat tcccttgata ccactctgtc tttatgactc atcggttatt 2617 

tataaatgtt atcatcgttt ttctgatttt taaataagtt tcagaattaa tacatttaac 2677 

tgaataaatc aaccaataat gtcaaactt 2706 

<210> 15 

<211> 2612 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .212 

<220> 

<221> 3'UTR 
<222> 213. .2612 

<220> 

<221> polyA_signal 
<222> 2586. .2591 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 15 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aca ttc acc ctg gga get ctg tgc tgc cat gca aat gtc aca ttg cac 209 
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Thr Phe Thr Leu Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 

25 30 35 

tga ggccgccatg ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga 262 

gaatcagcac ccatcagctt ttccagcatt ttgggtcctc ccagctaaat catcggcatt 322 

gtgaagcaga gatgagacct cctgccatgc cctttccaaa tcctggacca acagaatcat 382 

catatacact agtaagcttt tactttaagc cactaatttg gggtgtagct cttttgcagc 442 

aaaagatcat cagaacactg ccatgcagga gctgataaac ttgctcactg ttatgtatgt 502 

atccaatgac tactcacaca cacattactt gcttcactct actttctggc tttccattca 562 

gcaaatttta agaaatgaaa atgttattcg tgggaaatta actcaagatc aatattgttg 622 

tagttaatct ctttgtaaat ggtaaaatat taaattattg tgagtcattt ttatttaaag 682 

aagtttggtt ctttaaatga aagaagtttg ggtcttttgt ggatactctg catatagaca 742 

ttctagacat gaaagaatgt tgaagacaaa gaatgttgaa gaaatataga catgaaagac 802 

tgttgaagtt tggttttttg tggatactct gcatatagac attctagact ggagtgatac 862 

atcttgttca gggttttgtg catctctaag atgatcatct tggtgtgggg tcctgatctt 922 

cccagtgact tagcagcaca gtaatttggc tgaaatgttg atgagttcag acaagaagaa 982 

ggtgatgatg acaagagagc gtcacgggga aattctgccc tgaggaagga atagtttgag 1042 

gctggagaga tgtgctaccc ccgccactct ttctcaaatg gaaagacaac ctgtaggcac 1102 

agaaatcatt gctctgctta cactgaaatc tcctgctgac tttgagagct gagcattgga 1162 

ttttgatggg agagtgaaac ttagtgaaga aatgatttga acctttgttg atctaatact 1222 

tctgtgatac agttatctaa acagaatgag tctaaagtca gtataaagac ctgtgtctct 1282 

ccatgttttg ctgcttataa ttgcagagga acattttgag atagaaaatt ttatccctct 1342 

ttcaagtgac tttttataaa acataacctt ttaagagagg tcattctttt gaaaaattgt 1402 

ctgaatacaa aaaagtaact gaacagcaag ggattgtagc cataatgttc ttgaccactt 1462 

tagattgaca aatagtatct caaaattgat caggttctgg agaatctatt gaggttgtat 1522 

ggagtaacca agtgatggaa gaggttggaa gtgaataatt ttatgtggta ttatgtgatt 1582 

ttttttggtg ggggtagtgt ctggagctga gcatgatgga gcctagatgg ggcagccttc 1642 

taatgctccc agtatttcgc aaatcatatg tcacgtcaaa attgtaatga caaagctcac 1702 

agcagctgcc tacgccaggt ttggtgccag ctctaattag agcattgatc atttctctcc 1762 

ccttcatgaa tgcacatgag tgcacatttt catatataag ggcaaaggac atttcaaagt 1822 

ttggatggaa tctttcctac ttagaagtaa taaaaaacaa tactttgaaa tcttaccaaa 1882 

tgttaaacaa atagcaacaa tttttccatt tgaagatgtg tgtcttaagg tctgttggat 1942 

ctttcaaatt tctattttgg acatggaaac tacagatgaa atttaactta ctcaaagtta 2002 

cacagaactc ctggctgatc tcttttactt aattcttcta aaatctaaag taagaataat 2062 

tttgaaattt tattggtaat gcttaaataa caaaagttac tttaatatgt tacatgtttg 2122 

aagtttcatt tatacttaat atctaattta acgcttgtga aaatattatt cagtattttt 2182 

ttttcttttc taaaatgact tgttcagggc tccttttggg attttaagtg agaaaaactc 2242 

tatttcataa tattgttacc atgctcatcc aataacactt catagatggt tagccatctc 2302 

taaaattaaa ttttgtagtt gaggagcatt gaaaatatca cctatattrg gatttctgtt 2362 

taacacttca aaatgcattt tatcataatt taaaatttga tttaaataca aacctcttta 2422 

tagaaaacct gataaatgta gttttatttg tttgcttgct tttttttgca gggagggata 2482 

ttcccttgat accactctgt ctttatgact catcggttat ttataaatgt tatcatcgtt 2542 

tttctgattt ttaaataagt ttcagaatta atacatttaa ctgaataaat caaccaataa 2602 

tgtcaaactt 2612 

<210> 16 

<211> 434 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .370 

<220> 
<221> CDS 
<222> 371. .436 

<220> 

<221> 3'UTR 
<222> 437. .434 
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<220> 

<221> allele 
<222> 407 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 407 

<223> 8-126-286 : polymorphic base A or G 
<400> 16 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctcg cttctaattc cttttcagaa 120 
aacaaaaact cctaagagaa tgaatagaaa ttttcttaag gaattataaa gaaccatctc 180 
agaattttac tctgttactg caaaatctca tgcaaagata accaagtacg aagaattaac 240 
tgaaattcat cccaagatga ggcatcttgg ctgagttgca ggtccacatt cgttctagag 300 
gctgcctgcg gtccttagag gcctcatagg caaggaactc atagtttctt gccatgaaga 360 
cttttccaac atg get get tac ttc ttc aag cct gta agg aga tec rgt 409 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa 

15 10 
etc agt tgg cca ggt ctt gtg aaa g 434 
Leu Ser Trp Pro Gly Leu Val Lys 
15 20 

<210> 17 

<211> 760 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1..696 

<220> 
<221> CDS 
<222> 697. .762 

<220> 

<221> 3'UTR 
<222> 763. .760 

<220> 

<221> allele 
<222> 733 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 733 

<223> 8-126-286 : polymorphic base A or G 
<400> 17 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctcg cttctaattc ctgtacgtca 120 
ctttaccttt tttgtctata aatttattct gaccacaagg cacccctgga gcctctgtga 180 
gtctgctgtg gttctgggga ctgeeggatt cgcgaatcgt tcatttctca gttaaactcc 240 
tttaaattaa ttcagctgaa gtttttcttt tatcaatgaa aagactagac tggcttagee 300 
ttccagcctg cacctttctc ccatgctgga tgcttcctgc ccttgaatat cagactccaa 360 
gttcttcagt tttgggactc agactggctt ccttgctcct cagcttgeag atgecctatt 420 
gtggggtctt gtgttcgttt tcagaaaaca aaaactccta agagaatgaa tagaaatttt 480 
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cttaaggaat tataaagaac catctcagaa ttttactctg ttactgcaaa atctcatgca 
aagataacca agtacgaaga attaactgaa attcatccca agatgaggca tcttggctga 
gttgcaggtc cacattcgtt ctagaggctg cctgcggtcc ttagaggcct cataggcaag 00 u 
gaactcatag tttcttgcca tgaagacttt tccaac atg get get tac ttc ttc 714 

Met Ala Ala Tyr Phe Phe 
1 5 
aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys 
!0 15 20 

<210> 18 
<211> 23 
<212> PRT 

<213> Homo sapiens 



540 
600 
660 



g 760 



<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 



<400> 18 

Met Ala Ala Tyr Phe Phe Lys Pro 
1 5 
Pro Gly Leu Val Lys Gly Lys 
20 



Val Arg Arg Ser Xaa Leu Ser Trp 
10 15 



<210> 19 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 
<400> 19 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 

15 10 15 

Pro Gly Leu Val Lys Gly Asn Leu lie Thr Gly Val Lys Cys His His 

20 25 30 

Leu Ser His Val Pro Leu Val Arg Ser Pro Ser Ser Cys Pro His Ser 

35 40 45 

Arg Glu His Ser Pro Trp Glu Leu Cys Ala Ala Met Gin Met Ser His 

50 55 60 

Cys Thr Glu Ala Ala Met Leu 
65 70 



<210> 20 
<211> 38 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 



<400> 20 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 
15 10 15 
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Pro Gly Leu Val Lys Gly Thr Phe Thr Leu Gly Ala Leu Cys Cys His 

20 25 30 

Ala Asn Val Thr Leu His 
35 



<210> 21 
<211> 54 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 



<400> 21 

Met Ala Ala Tyr Phe Phe Lys Pro 
1 5 
Pro Gly Leu Val Lys Gly Asn Leu 
20 

Leu Ser His Val Pro Leu Val Arg 

35 40 
Arg Gly Phe Val His Leu 
50 



Val Arg Arg Ser Xaa Leu Ser Trp 

10 15 
He Thr Gly Val Lys Cys His His 
25 30 
Ser Pro Ser Ser Cys Pro His Ser 
45 



<210> 22 
<211> 28 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 



<400> 22 

Met Ala Ala Tyr Phe Phe Lys Pro 
1 5 
Pro Gly Leu Val Lys Gly Phe Cys 
20 



Val Arg Arg Ser Xaa Leu Ser Trp 

10 15 
Ala Ser Leu Arg 
25 



<210> 23 
<211> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=Gly or Ser 



<400> 23 

Met Ala Ala Tyr Phe Phe Lys Pro 
1 5 
Pro Gly Leu Val Lys Gly Asn Leu 
20 

Leu Ser His Val Pro Leu Val Arg 

35 40 
Arg Glu Asn Asn Ala Phe He Lys 

50 55 
Gin Gin He Ser Trp Thr His 



Val Arg Arg Ser Xaa Leu Ser Trp 

10 15 
He Thr Gly Val Lys Cys His His 
25 30 
Ser Pro Ser Ser Cys Pro His Ser 
45 

Ala Ser Ser Asp Val Thr Ser Asn 
60 
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65 



70 



<210> 24 

<211> 427 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 231 

<223> 8-102-231 : polymorphic base C or T 
<220> 

<221> miscjoinding 

<222> 219. .243 

<223> 8-102-231. probe 

<220> 

<221> primer_bind 
<222> 212. .230 
<223> 8-102-231. mis 

<220> 

<221> primer_bind 
<222> 232. .250 

<223> 8-102-231 .mis complement 
<220> 

<221> primer__bind 
<222> 1. .20 
<223> 8-102. pu 

<220> 

<221> primer_bind 

<222> 408. .427 

<223> 8-102. rp complement 

<400> 24 



<210> 25 

<211> 310 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 105 

<223> 8-104-105 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 93. .117 

<223> 8-104-105. probe 



cccacagaaa tatctaattg aaatattaaa 
gatggtataa aaattcatta attataattt 
tttcatggtt taggtacatg tcactgtggc 
ggacttgtgt atggtaaatt ttgtgagtgt 
gaagaaatat gtggaggtat gtatattggc 
tttcacctat ttcattgttc ttttatctct 
tagtttaaac ctatgtttat ctggaaaaca 
taagagt 



ttgatttgga ttatctactc tcttgagaga 
tatgtgcatg taacaaacag tcttttttgt 
aggtgtaagt gtgataattc agatggaagt 
gacgatagag aatgcataga ygatgaaaca 
tctgtagaaa ttaataaaag tacctgtttt 
acataagttt ctgctctagg tgtgctaatg 
tttcacataa ggagagtgag actatggcat 



60 
120 
180 
240 
300 
360 
420 
427 
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<220> 

<221> primer_bind 
<222> 86. .104 
<223> 8-104-105. mis 



<220> 

<221> primer_bind 
<222> 106. .124 

<223> 8-104-105 .mis complement 



<220> 

<221> primer_bind 
<222> 1. .21 
<223> 8-104. pu 



<220> 

<221> primer_bind 

<222> 295. .310 

<223> 8-104. rp complement 



<400> 25 

gatggtgttt ttccttttta gaatattttt gtctgaatat atgtgttttt ttcaaacaca 60 

aagtaaacag aaaataataa gcgatatgtg aaatgctttc tttcrtggtt tttcttgact 120 

ggattcttat aaaattcttc ttcccaccta gggacttgtg tatgtggtga atgtacctgt 180 

cacgatgttg atccgactgg ggactgggga gatattcatg gggacacctg tgaatgtgat 240 

gagagggact gtagagctgt ctatgaccga tattctgatg acttctgttc aggwaagggc 300 

tcttccaatt 310 

<210> 26 

<211> 418 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 251 

<223> 8-94-252 : polymorphic base A or G 
<220> 

<221> misc_binding 
<222> 239. .263 
<223> 8-94-252. probe 

<220> 

<221> primer_bind 
<222> 232. .250 
<223> 8-94-252. mis 



<220> 

<221> primer_bind 
<222> 252. .270 

<223> 8-94-252. mis complement 



<220> 

<221> primer_bind 
<222> 1. .18 
<223> 8-94. pu 

<220> 

<221> prime r_bind 
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<222> 400. .418 

<223> 8-94. rp complement 

<400> 26 



gtggtctatg caggtatgga tggacattct cacacctaac caagcaatgt ttgcttttcc 
tcctcctttc ttcccatcac tagttgccat gtaccgagaa ccatctttgc atgatgttgg 
ggaaacggtc ccgaagcctg gggtgacgcc aagtaaaagc acaagtgcgt ctgcaataat 
gaatggaggc aaaccagtca acaagagtaa gacaacatag ccagatcctc acaggtgttg 
tgacttattc rtcctgagca cagttgagtg atttatcctc accagacatt cctgctccgt 
ggctgaagag cagcaggaag taagctaatg cttattcttt gctgtctccg aacttctctg 
ttgcaagtgg ataaatctca acctttgcac cccccacaac aagaagacac ctggataa 



<210> 27 

<211> 419 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 43 

<223> 8-95-43 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 31. . 55 

<223> 8-95-43. probe 

<220> 

<221> prime r_bind 
<222> 24. .42 
<223> 8-95-43. mis 



60 
120 
180 
240 
300 
360 
418 



<220> 

<221> primer_bind 
<222> 44 . . 62 

<223> 8-95-43. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 8-95. pu 

<220> 

<221> primer_bind 

<222> 400. .419 

<223> 8-95. rp complement 



<400> 27 



gactggaact attctaggag atttgggttt aatagtgtgc tcrcctcctt gaaagcctta 60 
gtgattccgg gactctcctc ataagtgtgc actttcttta tactttccag gtaaatgtqa 120 

180 



atgtggcaaa tgcacctgct atcctccagg agatcgccgg gtgtatggca agacttgtga 

^ c ? ct gtg aagacctcga tggtgtggtc tgtggaggta gtaacctttc 240 

300 
360 



tcatagctgt atgctacctg catgcaactt gctttacctg cctttgttat tctactgact 
cttgcttttt caattgctgt taaaaataag ataaacagaa agcaatatac cattgaacct 
aaaagcaata tacaattagg tttgtaatga cttttatttc cctgactcct ttatttagc 419 

<210> 28 
<211> 350 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> allele 

<222> 181 

<223> 8-97-181 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 169. .193 

<223> 8-97-181. probe 

<220> 

<221> primer_bind 

<222> 162. . 180 

<223> 8-97-181. mis 

<220> 

<221> primer_bind 
<222> 182. .200 

<223> 8-97-181. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 8-97. pu 

<220> 

<221> primer_bind 

<222> 331. .350 

<223> 8-97. rp complement 

<220> 

<221> allele 
<222> 98 

<223> 8-97-98 : polymorphic base C or T 
<220> 

<221> misc_binding 
<222> 86. . 110 
<223> 8-97-98. probe 

<220> 

<221> primer_bind 
<222> 79. .97 
<223> 8-97-98. mis 

<220> 

<221> prime r_bind 
<222> 99. . 117 

<223> 8-97-98. mis complement 
<400> 28 

ctgagaaacc agccattcaa tacactttta cctttggatg atgggttatt agaagcctgt 60 
tgtcagtgtg tcaacgatat tctattcgag atggaaayct ttagttataa ggtgatggtg * 120 
agtaatagcc tttgtaattt caggttcttg tcattgtggg aagtgcattt gttctgctga 180 
rgagtggtat atttctgggg agttctgtga ctgtgatgac agagactgcg acaaacatga 240 
tggtctcatt tgtacaggtg cagtattaac cttttctaat tgctctatgc cacagtttgt 300 
tatagagcca agagacactc gtcatgcaat gctttagtgg gctggaccaa 350 

<210> 29 
<211> 434 
<212> DNA 
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<213> Homo sapiens 



<220> 

<221> allele 
<222> 68 

<223> 8-98-68 : polymorphic base A or G 



<220> 

<221> misc_binding 

<222> 56. .80 

<223> 8-98-68. probe 

<220> 

<221> primer_bind 

<222> 49. . 67 

<223> 8-98-68. mis 



<220> 

<221> primer_bind 
<222> 69. .87 

<223> 8-98-68. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 8-98. pu 

<220> 

<221> primer_bind 

<222> 415, .434 

<223> 8-98. rp complement 

<220> 

<221> misc_feature 

<222> 414,416 

<223> n=a, g, c or t 



<400> 29 

atttcattgt 

taatcatrat 

tgaatgctgg 

ttaacaatta 

atgcgaagga 

ttctatttct 

agatgcaaat 

gagagcaatt 



ggacacttgt 
acctatatgt 
gatggatgga 
catgagagag 
aaccatgtat 
gaattacgaa 
aacattagaa 
tttc 



gatttataat 
attttattgc 
atggaaatgc 
gtctggattc 
attcaccact 
tgaaatccga 
aaaaaagatt 



agcatgttta 
agggaatgga 
atgtgaaatc 
ttattttttc 
aggacaggtt 
gtacctatta 
cttccataat 



aatttggcac 
atatgtagct 
tggcttggct 
tgggccatta 
aaaaagacca 
gaaatgagtt 
taacataagt 



attttgatca 
gtggaaactg 
cagaatatcc 
gaacatataa 
ttgtatgttt 
atgcaaattt 
ggtncntaac 



60 
120 
180 
240 
300 
360 
420 
434 



<210> 30 
<211> 477 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 108 

<223> 99-14021-108 : polymorphic base A or G 



<220> 

<221> misc_binding 

<222>. 96. ■ 120 

<223> 99-14021-108. probe 
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<220> 

<221> primer Jaind 

<222> 89. .107 

<223> 99-14021-108. mis 

<220> 

<221> primer_bind 
<222> 109. .127 

<223> 99-14021-108. mis complement 
<220> 

<221> primer_bind 
<222> 1 . .18 
<223> 99-14021. pu 

<220> 

<221> primer_bind 
<222> 465. .477 

<223> 99-14021. rp complement 
<400> 30 

tttgttgtta atcgcccctt ttctgcaaca cttgtgggtt agggaaaata attctaaagc 60 
aagagcaaag acagagttgg gagatcacca gtgaggttca attttccrtc acattcactc 120 
tgctccacac ctcagataat catgtgctta actgcgaaac ttgcttgaca attacagaac 180 
actttctcac ccattactac cttgatcctc acaattctgt ggggtagtag gagcagatgc 240 
tgaaattgcc atacgcaaat cagtgaactg aagcttagag acctccagca ggggcagagg 300 
gtcagcggaa actatcccag ggttcagcca acaagaaagt atattggaat cagagtatta 360 
aaataagaat aataaaacca actaaaattt accgtgcttt ttatttccac tcagtgccaa 420 
caattcttaa cagtgtcagt gatggatccc tgtgccccag ggrmcagact tcttact 477 

<210> 31 

<211> 473 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 

<222> 415 

<223> 99-14364-415 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 403. .427 

<223> 99-14364-415. probe 

<220> 

<221> primer_bind 

<222> 396. .414 

<223> 99-14364-415. mis 



<220> 

<221> primer_bind 
<222> 416. .434 

<223> 99-14364-415. mis complement 
<220> 

<221> primer_bind 
<222> 64. .82 
<223> 99-14364. pu 
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<220> 

<221> primer_bind 
<222> 434 . .454 

<223> 99-14364. rp complement 



<400> 31 

catgcccatc ctaaatattt acatgccctg acagataagc ccttagcctg cccaatgggt 60 

aacttcaaca gcatctctag tacaactgtt gctcattctg tgttttaatt caacccagct 120 

ataagatacg aaatgataga attgctctag attctctatt ggttaaataa ggagatattt 180 

gtgctattgc caataataca tgctgtacct ggataaaccc ctttgggcaa gttgtgatgc 240 

aaatactcaa gaaaataggc cacatagtta caacaggact tacctaattc cccatggtca 300 

tttggctgat tcagtcagtt gctttcaagc ctaggttctt ggctcaatat tattacataa 360 

actagaattt tcctattact attaatttta ctttgtattt ttctttataa acttygtact 420 

tattgcttgt caaatttcag cagaagtaca actcctgaga gaataatgct ggc 473 

<210> 32 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 99 

<223> 99-15056-99 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 87. .111 

<223> 99-15056-99. probe 

<220> 

<221> primer_bind 

<222> 80. .98 

<223> 99-15056-99. mis 



<220> 

<221> primer_bind 
<222> 100. . 118 

<223> 99-15056-99. mis complement 
<220> 

<221> primer_bind 
<222> 1. . 18 
<223> 99-15056. pu 

<220> 

<221> primerjoind 
<222> 482. .502 

<223> 99-15056. rp complement 



<400> 32 

caggaaactc acaagaagsc agatttcctt cgagcacctc ctgaataaag aggcaaaggc 60 

cttcttaact cttacaattt acaagtggct atgagtgcyt ttatagttcc cataataatt 120 

tctccacgta gacttcctaa ataataattt ctcctgtttt atattctctg tgcttatgtt 180 

tatatcaaac aagttaccac ttaatcaaat gccgatttgc attgctcact atgtaacttt 240 

aattttcttt gcctcttatt tttggatctt aattctaaaa ctagatgatc ataaattcat 300 

ttaggaataa gcttgtgatc tagccttctt ttgaacccct ttgtgctcct cacaatattt 360 

gtttcgatga aacagtgagc aacatttgat ctatgattgt taatagaaaa acaccaatgt 420 

ctcaagttat tgtaaacata ggcataattg acctttggtt ctataaatat gtttggtgtt 480 

ccccaaaata cgtctccctt tt 502 
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<210> 33 

<211> 494 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 412 

<223> 99-15229-412 : polymorphic base A or G 



<220> 

<221> misc_binding 

<222> 400. .424 

<223> 99-15229-412. probe 



<220> 

<221> primer_bind 

<222> 393. .411 

<223> 99-15229-412. mis 



<220> 

<221> primer_bind 
<222> 413. .431 

<223> 99-15229-412 .mis complement 



<220> 

<221> primerjbind 
<222> 1. .20 
<223> 99-15229. pu 



<220> 

<221> primer_bind 
<222> 476. .494 

<223> 99-15229. rp complement 



<400> 33 

ctgtcattga gaaatgctac caataatact tagagaattt gatacaactc agtctgaaaa 

agctaagatt agcagaacag agctgtctcc aaatatttga agaactattt tatttaaggg 

attggaccca tttttgtatg tagttccaga ggagcagatg gtgaccactg tccaggcaga 

tgtgtctcaa tgtaaggaca acatctgtaa tattaataat tagaatgtat cctgtaattt 

tctctctacc cttggaaacc agtcgagatc cagagtcttt cactgggagg cttaaagcct 

agagcagcct tggtgctaga ggcggacagg gataatgaac taatcttgaa ccaattcatc 360 

catagcaatc tcaatgcttt cgttagctct tataggtatt taatacggcc araggaatga 420 

aggtagtctt gctggtttag aagccctgcc taccacaacc cctacaccac cccatcccct 480 
gcatagtctg atgt 494 



60 
120 
180 
240 
300 



<210> 34 

<211> 485 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 291 

<223> 99-15232-291 : polymorphic base G or T 



<220> 

<221> misc__binding 

<222> 279. .303 

<223> 99-15232-291. probe 
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<220>. 

<221> primer_bind 

<222> 272 . .290 

<223> 99-15232-291. mis 



<220> 

<221> primer_bind 
<222> 292. .310 

<223> 99-15232-291. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-15232. pu 

<220> 

<221> primer_bind 
<222> 467. .485 

<223> 99-15232. rp complement 
<400> 34 



caatagaaca ggctgctcct ttataattat taatcatagt gtatattaat tcatcatcac 
atacgtggct agaaaaaaat ttagaacaaa aagatatgtg atatgtaaag gcctacgata 
attcagactt ctttgaggag agcttttatt ttattgttat tcttatttta tctcttgtca 
atataaattg agagaataaa cagacaaaca ttacaaatta gtgattaatt gcatttaaaq 
cctagttaag actatttaag actattatgc ataatacagg aaaactacct ktattattta 
tagtgggtgc cttctgaagg atctgaagga gaatcagttc tatgcctctc tcctcattcc 360 
caggaggtgc ctggcattcc ttggcttgta gacgcatcac cctaatctct acctctgcct 420 
tcacatggtg tcccctgtgt gtgtgttttt gccccatgtg tctcctcttt ttatatggat 480 
gccaq ^ 



60 
120 
180 
240 
300 
360 



gccag 

<210> 35 
<211> 477 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 382 

<223> 99-15253-382 : polymorphic base C or T 
<220> 

<221> miscjbinding 

<222> 370. .394 

<223> 99-15253-382. probe 

<220> 

<221> primerjbind 

<222> 363. .381 

<223> 99-15253-382. mis 



<220> 

<221> primer_bind 
<222> 383. .401 

<223> 99-15253-382. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15253. pu 
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<220> 

<221> primer_bind 
<222> 459. .477 

<223> 99-15253. rp complement 



<400> 35 

aaaatcaatt 

ctattttatg 

ttttttttac 

tccataatgt 

tcatcttttc 

aaataaataa 

tgcacagcac 

attacacagg 



ccccaacact 
gtttcatgtg 
aatattttat 
gagtggtaaa 
cttaattttt 
attactttgt 
agcctgtaat 
tcctcattgt 



cattttgtac 
ttacagatga 
gattacaaag 
tgcagagcac 
ttccaaccta 
aaataatctt 
tyctatgaac 
gtaactaaac 



gctaattttg 
aaaaaaaact 
ttagaagact 
tggcctaaag 
tgatctgttg 
gactggtttc 
ctagggaaat 
attgcataga 



taagatcctg 
agaattcaaa 
aagaataaaa 
aaaatatttc 
aatgagcatt 
tgttgaccac 
gtatttaagt 
atatgccagt 



aaaagtttca 
ttttctgagt 
tggcctaatt 
aaaaaattag 
ttgcatatat 
agtaacccac 
ttattttttg 
gatgatg 



60 
120 
180 
240 
300 
360 
420 
477 



<210> 36 
<211> 456 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 392 

<223> 99-15256-392 : polymorphic base C or T 



<220> 

<221> misc_binding 

<222> 380. .404 

<223> 99-15256-392. probe 

<220> 

<221> primerjDind 

<222> 373. .391 

<223> 99-15256-392. mis 



<220> 

<221> primer_bind 
<222> 393. .411 

<223> 99-15256-392. mis complement 



<220> 

<221> primer_bind 
<222> 1. . 18 
<223> 99-15256. pu 



<220> 

<221> primer__bind 
<222> 439. .456 

<223> 99-15256. rp complement 



<400> 36 

cctctctatg atgcttccta ttaagcaatt ggggaaatgt aataaacaag ggttggtgag 60 

catcttcctt agtgagatgt ttttggaaga attggataat tgagtgaata atagtgagaa 120 

actcctgtgt ctgatgttgc tccatgttgg aatgctttta tgttctcaga gaatgagtca 180 

ctgagagcca attgtgatga tacacaatgg ttttacccag gttggatatg gtcctctgta 240 

ctggtaccct ttaagtcagt ggcactaatc agtcagtcat tgtcatgctt tgtgttggtc 300 

catcatatgg tatgccctct tagagaacat cctgattagt ccttagacat cttttcaatt 360 

tgaacactgg ggctcctcat tcgggtaaaa aytatggaca gtcagtgaaa ctgttgcaat 420 

ggcccctcat agcagattgg atctcaatgc actttg 456 



<210> 37 
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<211> 541 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 337 

<223> 99-15258-337 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 325. .349 

<223> 99-15258-337. probe 

<220> 

<221> primer_bind 

<222> 318. .336 

<223> 99-15258-337. mis 

<220> 

<221> primer_bind 

<222> 338. .356 

<223> 99-15258-337. mis complement 
<220> 

<221> primer_bind 

<222> 1. .19 

<223> 99-15258. pu 

<220> 

<221> primer_bind 
<222> 521. .541 

<223> 99-15258. rp complement 
<400> 37 

cagtaaagaa ggacttagga 
aggatttatt aaacacagct 
aaaatttaca aactgatgtg 
aaaaaatata aatgggggca 
tactagggaa aatttagaag 
tctacaggat gaaggaagaa 
gaaatcaata aggcatgtat 
agggatcagt taacacaaac 
aaaattaagc acttcccagt 
t 

<210> 38 
<211> 500 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 200 

<223> 99-15261-202 : polymorphic base A or G 
<220> 

<221> miscjbinding 

<222> 188. .212 

<223> 99-15261-202. probe 



aatatttttt gaatcatgag ctgctaatag atcagtaaat 60 

gtattcagga ctctgacctg tgtgggcctt agggaaatac 120 

attttcttac cagtaagaaa tcttatgaaa gattaaatta 180 

actcaatcac ataaaagtaa tatattccat gtgactgaaa 240 

tagaatttaa gtgagaattt caagcagggt tctgaaagat 300 

ttgcccatat gccaatkgcc tatgtgaaat cacaaaaaca 360 

ggaagtacag agtgattgcc tagttagaaa aagagcgtga 420 

aagaagagtt taatgcttat ctgcaataag ttggtagagc 480 

atactgatag acgttaatty ccactctacc atgtgaaatt 540 

541 
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<220> 

<221> primerjDind 

<222> 181. .199 

<223> 99-15261-202. mis 

<220> 

<221> primer_bind 
<222> 201. .219 

<223> 99-15261-202. mis complement 
<220> 

<221> primer_bind 
<222> 1 . . 19 
<223> 99-15261. pu 

<220> 

<221> primer_bind 
<222> 481. .501 

<223> 99-15261. rp complement 
<400> 38 

cttctaatcc tttgtttcca cttattttat ttcattcctc attttatccc ttttttctaa 60 
attccatttt attatactta aggtgctttt aatatggtta tcatactcct gatagtgtta 120 
tttctttctt agtcttctta tataagcgct atacgttcac attccatctc ctttggttat 180 
ctttccattt cttcaccgar cctctttgct ctcttttttt atagctggtt cactcaaaat 240 
gtcttacttt gccatttttg aaatttattt tcattctttt atgtactgaa taaaatttaa 300 
aaatacttta tcatggtggg aggtacccgt gatgtccaaa taagtgttta tattaattgt 
tggggttttt ttgtttgtgt gttttttgaa aggttaagaa aatctcattc agaaagtaag 
ttgtttaaaa attctggacc aaatttacca cacatcaagc agatacttac caagttgttt 
gtagacatta gcagtattta 



<210> 39 

<211> 541 

<212> DNA 

<213> Homo sapiens 



360 
420 
480 
500 



<220> 

<221> allele 
<222> 432 

<223> 99-15280-432 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 420. .444 

<223> 99-15280-432. probe 

<220> 

<221> primer_bind 

<222> 413. .431 

<223> 99-15280-432. mis 



<220> 

<221> primer_bind 
<222> 433. .451 

<223> 99-15280-432. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-15280. pu 
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<220> 

<221> primer_bind 
<222> 521. .541 

<223> 99-15280. rp complement 
<400> 39 

atgtccatcc atcttgccca 
tacttgcctc acctattgct 
catcttccca cctcagagca 
ttgtcccttt cttaatgtct 
cacttacatg tcatattttc 
attgctttta aaatttctct 
agtccattat cttatttctc 
agccatccag tyagtggtac 
agaatgtttg caatcatctg 
t 

<210> 40 
<211> 489 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 150 

<223> 99-15355-150 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 138. .162 

<223> 99-15355-150. probe 

<220> 

<221> primer_bind 

<222> 131. .149 

<223> 99-15355-150. mis 

<220> 

<221> primer_bind 
<222> 151. .169 

<223> 99-15355-150. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-15355. pu 

<220> 

<221> primer_bind 
<222> 471. .489 

<223> 99-15355. rp complement 
<400> 40 

taacttctcc gtctctcctt cttagcccat atgtcaataa tgactgaaag tattcatttc 60 
catctttaaa ctgcctattc cagccacctc ccacctccat ctctttcctt ctaagttttc 120 
ttcatcttct actttgggca aaaggaaaty gatgtgtcag acaggcctag ttttgaattc 180 
tggatctgct agcacttctc tgtgtgtcct tggttatatg atatagtctt aaaccttaat 240 
gttcttgcct gtaaaatggg gataataaaa acctcttaac agtggttgtt tcatgcagct 300 
ttcattacaa acttcctcat tcaaaatctt caatgatttc catttttcac aaaatgaaat 360 
tcaaaatttc tgtagattat tgagacaagt cccctactct tcacctaaat ttatctttta 420 
tttattctct catcattatc aacaactact aggctttgtt gccttgactc cagaggcaaa 480 



gagagagttt ctacaacact tcctctgcaa gccctttccc 60 

ttcctctgtt acgttgtatt cccctcactg tttcttccaa 120 

tggacacttg ctgctctttc tgtgtcatga tgctgctcac 180 

cctccctgag ccaatcttct ccacccccac aacttacgca 240 

cttcatagcc tttaacacca tttgaaatga tatatatttg 300 

gtccccccac taaatataaa cttcaggatg gcaagaatgt 360 

cagcctccat acttttaaga aaataaattt tggttgtata 420 

ttggttatag cacccctagc aaaagaatac aaaaaaaggg 480 

tttgaggcta ggaattccca gagaggraam caaggagtaa 540 

541 
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aatcttatc 

<210> 41 
<211> 450 
<212> DNA 

<213> Homo sapiens 



489 



<220> 

<221> allele 
<222> 298 

<223> 99-15663-298 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 286. .310 

<223> 99-15663-298. probe 

<220> 

<221> primer_bind 

<222> 279. .297 

<223> 99-15663-298. mis 



<220> 

<221> primer_bind 
<222> 299. .317 

<223> 99-15663-298. mis complement 
<220> 

<221> primer_bind 
<222> 1. . 18 
<223> 99-15663. pu 

<220> 

<221> primer_bind 
<222> 430. . 450 

<223> 99-15663. rp complement 
<400> 41 

tcccaccttc ttctaaacgt gttgcttcaa tacgttgata ggtgaggaca cttaaaaatt 60 
agactttata gaaataggtt tttttttgtt tacatatata gttcttttgg tatcatatat 120 
ttagcctctt tctaaaattt attttttgat actgaaggga gaaataggga gttattaatc 180 
aacaggcatt aattttagtc aagcaaaata aataagctgt agcgatctgc tctgtaacat 240 
tgtacctaca gccaacaatt atatgttgtc cacttaaaaa tgtgttagat ctcatagyaa 300 
ctcttcttac cacaataaag taaaaattct gaaacaataa gtgaatacct aaataataca 360 
aacaaataca atattgtagt tttgggcact taataaatga cagcctcatt tctcaattag 420 
agatcatcac aagttagaca gatgacgatg 4 5 0 

<210> 42 
<211> 502 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 185 

<223> 99-15664-185 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 173. .197 

<223> 99-15664-185. probe 



157 



WO 01/40493 



PCT/IBOO/01507 



<220> 

<221> primer_bind 

<222> 166. .184 

<223> 99-15664-185. mis 

<220> 

<221> primerjDind 
<222> 186. .204 

<223> 99-15664-185. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15664. pu 

<220> 

<221> primer_bind 

<222> 483. .502 

<223> 99-15664. rp complement 

<220> 

<221> misc_feature 
<222> 54 

<223> n-a, g, c or t 
<400> 42 

gtttaccatt agcactgtca tatttgtgtg acttgtcatt ctctacagcg gagnacgggc 60 
tggcacgggg cctgatgctg acttgcacaa gggaagcctc ctgtctctga cttccccagg 120 
ataattcctg gggaaagtgt gctccctagt gttaagagcg gtttaatggc tggagggttt 180 
cagcJcggctg accaggcaga gaaggagggt gaatcacctc tcagcactct ccacttagac 240 
tttgtgtggt cgtcgggtgg tcaaaccttc taactagttg tattgcagat ttggcattcc 300 
agtgcaaaca aaagacagaa acacaatgtt cacatgcttt ccagagatca cctggatatc 360 
agatcatttg attttcaagt aagtcgaaac cttggtggaa atcattaact atcctgttta 420 
tgaccaaaaa ataaaatccc aaatttcttc tcttcatttc ttacctgctt taaaattgta 480 
tccaaagcgt gaaagtaaaa ga 502 

<210> 43 

<211> 476 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 

<222> 398 

<223> 99-15665-398 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 386. .410 

<223> 99-15665-398. probe 

<220> 

<221> primer_bind 

<222> 379. .397 

<223> 99-15665-398. mis 

<220> 

<221> primer_bind 
<222> 399. .417 

<223> 99-15665-398. mis complement 
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<220> 

<221> primer_bind 

<222> 1. .20 

<223> 99-15665. pu 

<220> 

<221> primer_bind 
<222> 458. .476 
<223> 99-15665. rp complement 

<400> 43 

cgtaaatgtg aaaagcatag cctcttcttg gaatgttaag tataaatatc tgaaatactg 

ggcttgatat gtcaacagga gattgatgga taaaaataga attttatata aaaaacaact 

ggacatatta gattgttaac ttggaagaaa gaccatattc aaagaagaaa acatagtgac 

taatttcaaa catttaaagt cttccctgtg gaaacaaagg aatatctttg ttctaacact 

tcaaagaaca gggttaaaaa atagactcac cacagagtaa atgcacaatt gacaatcgtg 

aatgaattaa aaaccaaaca aaatattttg tcagctttct atctatgaaa ctaagaaaca 

ggcttcctac taaggtaatg aatgtaattc acagagarca ttcacgtata agtttcattc 

atgtttcaaa tttcattgat ttgatcaatg ggttattcta ataccctccc ttattt 

<210> 44 
<211> 459 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 139 

<223> 99-15668-139 : polymorphic base C or T 
<220> 

<221> misc_binding 
<222> 127. .151 
<223> 99-15668-139. probe 

<220> 

<221> primer_bind 

<222> 120. . 138 

<223> 99-15668-139. mis 

<220> 

<221> primer_bind 
<222> 140. . 158 

<223> 99-15668-139. mis complement 
<220> 

<221> primerjbind 
<222> 1. . 18 
<223> 99-15668. pu 

<220> 

<221> primer_bind 
<222> 439. .459 
<223> 99-15668. rp complement 

<400> 44 

agaggtgaga gaaaggtgaa ttgtgcaaat atggacatcg aagttattca gacacatatg 60 

gtttccatgt cgtgagacct taaagcactc tcagaaactc caagaaagct gttgattggt 120 

cagatttctc acttttgtyc cctcaaaaca agattgtcct gaacttatag gaaatatact 180 

caaaagtagt atgatttaag gaagtcttcc aaatacctgc ctcgtttaca gataatcaat 240 
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ggaattccaa aagattttat gtgttttaag ccctaaaaat gcaaatatcc aggaaagggc 300 

tagcactaaa cagagctggc atgtatgcct cagtgcaggt gtatatctca gtgtctcctg 360 

aagccagggc cttagaggat tgtggaaata ggtacactga cagcagggac tggaagccac 420 

agtctggaga gcaaagagca gaatccatac ttacacctc 459 

<210> 45 

<211> 547 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 

<222> 166 

<223> 99-15672-166 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 154. .178 

<223> 99-15672-166. probe 

<220> 

<221> primer_bind 

<222> 147. .165 

<223> 99-15672-166. mis 

<220> 

<221> primer_bind 
<222> 167. .185 

<223> 99-15672-166. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-15672. pu 

<220> 

<221> primer_bind 
<222> 533. .547 

<223> 99-15672. rp complement 
<400> 45 

ccaataccat aactcctcta taggacatgg aagagtatta tatatgacaa atgattgcta 60 

tgattattat tatcagtgtt attattatcc taatcctaag taatccaata aaagaaaaat 120 

acatctgtgc ctgtgcgtat gtgcacgtgt gtgcagtcaa atacaygttg agtaaaggta 180 

aagtctagct gtatttaatc aacctacctg aatcctcagg aaaaaattct aaacctagtt 240 

taaaacatgt aaactctaag ctctctcctt atagtcagtt agtagcagca catcttaaaa 300 

tctggtgtga atattctctt agttctacat gagtctaact aaacagagga ttattcttag 360 

gtgtttgaaa gagacatatg tgacactgct gttttgagaa caatttaagt gttgtcttgt 420 

catgtacaga agttctcata ttactttaca taaatggttg cataattgtt ttatagtaaa 480 

taatagactg tcaatatttc taggataact ccaaaacaaa atttcctaga mmacattttg 540 

aaaaggg 547 

<210> 46 
<211> 500 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 318 

<223> 99-15682-318 : polymorphic base A or T 
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<220> 

<221> misc_binding 

<222> 306.. 330 

<223> 99-15682-318. probe 

<220> 

<221> primer_bind 

<222> 299. .317 

<223> 99-15682-318. mis 



<220> 

<221> primer_bind 
<222> 319. .337 

<223> 99-15682-318. mis complement 



<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15682. pu 



<220> 

<221> primer_bind 
<222> 482. .500 

<223> 99-15682. rp complement 



<400> 46 

gtttgatgag 

agagaggcag 

gaatctttcc 

tatcacaaag 

catttcacta 

tacggcatga 

aactgttgac 

agtacaaaat 

tgtatgtatg 



caacaagtgg 
tagacacaat 
attaaatatt 
atcagtaact 
taatctctaa 
aattgagwtc 
atttacaatc 
taattatgag 
ccaataaagg 



ccacacaagt 
aaaaaattct 
tgtttaaagt 
tatttcctct 
gatacagttc 
taaaactctt 
attgaagaac 
ataatatatg 



attggagaaa 
atataacaaa 
tacagactac 
gtatagtaga 
aagggcacaa 
gtatctttaa 
tatttgaact 
gagcctgkcg 



aaaaaaatta 
tgctatatct 
ttagatgtta 
tgcctcttct 
ttttataatt 
caaccatttt 
ctgtagagta 
gctttatttt 



gaatgtgggg 
ttaagaaatc 
aagtatcact 
gtagaattaa 
caaagatatt 
gtaattaaat 
tagatattga 
ttaataagtw 
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<210> 47 

<211> 461 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 217 

<223> 99-16081-217 : polymorphic base C or T 



<220> 

<221> miscjbinding 

<222> 205. .229 

<223> 99-16081-217. probe 

<220> 

<221> primer_bind 

<222> 198. .216 

<223> 99-16081-217. mis 



<220> 

<221> primer_bind 
<222> 218. .236 

<223> 99-16081-217. mis complement 
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<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-16081. pu 

<220> 

<221> primerjoind 
<222> 443. .462 

<223> 99-16081. rp complement 
<400> 47 

caaaggacag agaaccagca ggcaggccct gggtcaagcc ccgtgatgtc ccaggaggag 60 

gggaggaggg cggagtcgga agggagcgag ggagccaggg aggcagggcc ctctgagggc 120 

agggagttct ccatgaagtt ccagaagcca atgaccagat cggcgcatgc aaacagctta 180 

aaagaaaaca ttaggcactg gtggcttcgc tctaaaygaa cagttcttaa aactgctttc 240 

agttctagcc ataataaagg catttgaatt atgttgcttt tgatgaaact ataaagatat 300 

tggcttttat acagtatcat gccagaaaca aatttataac attgttatgg tataacagca 360 

ggaatacttg gaatacacat ttttcagctg agctctaaga attacttttc ccttgcccca 420 

ggttggcaat taagaactat aacttactcc accccactcc a 4 61 

<210> 48 

<211> 532 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 218 

<223> 99-16082-218 : polymorphic base A or G 
<220> 

<221> miscjbinding 

<222> 206. .230 

<223> 99-16082-218. probe 

<220> 

<221> primer_bind 

<222> 199. .217 

<223> 99-16082-218. mis 

<220> 

<221> primer_bind 

<222> 219. .237 

<223> 99-16082-218. mis complement 
<220> 

<221> primer_bind 

<222> 1. .18 

<223> 99-16082. pu 

<220> 

<221> primer_bind 
<222> 512. .532 

<223> 99-1 6082. rp complement 
<400> 48 

caattcagca agagccactt tggttaacaa accttctggg aatgaaggtg ctttacaaag 60 
aagcaggctg tgataaaaac aaacaagaca acattcataa actgtgggaa gcattgttgt 120 
cattgggtca cacctcctgc accaggcagg tggttgtctg ctgcgttcct ttacagaaca 180 
gatgctgagc ctgcctggag gtggcaattt accaatcrat ggtcttaagc aattccaaag 240 
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gctggggtaa ggggtagaaa agagaagctc aaaaatgtta gggatggaga aggggggagt 300 

gattcaggaa gaggagaaac ctgttcattt cctcccaaat tacaaccact g?c2I?c£ 360 

tggctagtcc tgtgcaaact tccagtcccc agctgatatc cctgcagaca agtagagaga 420 

ggctagaaca aaagctaaag tgtagatgtc ccatacacta atcagtgttg ttttttaatc 480 

agaggttgaa attcatactc tgatgcaaac tgtattagtc agwtgttgct gc 532 

<210> 49 

<211> 451 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 81 

<223> 99-20933-81 : polymorphic base A or C 
<220> 

<221> misc_binding 

<222> 69. .93 

<223> 99-20933-81. probe 

<220> 

<221> primer_bind 

<222> 62. .80 

<223> 99-20933-81. mis 

<220> 

<221> primer_bind 
<222> 82. .100 

<223> 99-20933-81. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-20933. pu 

<220> 

<221> primer_bind 
<222> 432. .451 

<223> 99-20933. rp complement 
<400> 49 

ttccattcac atgagaaact atgtgtttac tatttctgtt atgttttgta tgattttgtg 60 
tgcttgtata ttttcagtaa mtgctggggt agcaagtgtc agaatcactc caaaaaaact 120 
gagatatatt tgccaagtca acagtctcca gactattgtt tcagcaagac aaggggcaag 180 
gctctttctg ggcctgagta atcagtaaat atataaaaag aataagaata atagtgaagt 240 
tactgctaga cgtatttcct tacatctgaa cattttcttt catttgcaac acttaacctg 300 
gaatcgggtg cttccagaaa atcctgtaaa aatcatgtag cttccaagat ttaaagctgg 360 
cctagawaac atgaacatga tcccatgtct tatctttact catacatcag gaattctcta 420 
aatttctggg agaacatttt gtagaagaca c 451 

<210> 50 

<211> 511 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 71 

<223> 99-20977-72 : polymorphic base A or C 
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<220> 

<221> miscjDinding 

<222> 59. .83 

<223> 99-20977-72. probe 

<220> 

<221> primer Jbind 

<222> 52. .70 

<223> 99-20977-72. mis 

<220> 

<221> primer_bind 

<222> 72. . 90 

<223> 99-20977-72. mis complement 
<220> 

<221> primer_bind 

<222> 1. . 18 

<223> 99-20977. pu 

<220> 

<221> primerjoind 
<222> 491. .511 

<223> 99-20977. rp complement 
<400> 50 

gtacatggta ggtggtagat ggtgaaataa tttttatatg taaggagttt caaagatgat 60 
ctaaaacatg maggcctaaa acaatagatg aaacactatt tcaaaatgta ttctctcttt 120 
gttttcactc tccatgttac aaaaatggca acaaattacg taaatgccaa ggaagccaat 180 
atatcatatg gtaaatgaca aatcaactgt aacattgaaa caacaattaa tagctattat 240 
agctattaaa atttgaagca aaattgtgca tctttaaaaa ttatacacac atgtgagagg 300 
actcatgact gaaatctcaa tataccatct atccaagttg agttattttc cttttttttt 360 
ttttttaaga taattattct tcttcctact tctgggaggg acagtgtatg ggaggtacct 420 
gtcaggcact gtgagaaccc tgggctctga acagatatgg ggtatttctc tcagtatcct 480 
caatgcagct ccaataacta atcccatttc t 511 

<210> 51 

<211> 466 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 89 

<223> 99-20978-89 : polymorphic base G or C 
<220> 

<221> misc_binding 

<222> 77. . 101 

<223> 99-20978-89. probe 

<220> 

<221> primerjbind 

<222> 70. .88 

<223> 99-20978-89. mis 

<220> 

<221> primer_bind 
<222> 90. .108 

<223> 99-20978-89. mis complement 
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<220> 

<221> primerjbind 
<222> 1. .19 
<223> 99-20978. pu 

<220> 

<221> primerjbind 
<222> 446. .466 

<223> 99-20978. rp complement 
<400> 51 

acttaactgc tacctaatgc agcctgtaca agcggaataa ttgaactaaa atgtcttcaa 60 
cactaaaaat aaaatgcaaa tcaaatctst ttaaaaaacc aaaaccaact tgtattttga 120 
agattaactg ccactgtgtc taaacccaaa tcctgctagc cagagctgaa tttgtgatta 180 
caaagcagta acgcaaaggt tagaaaggat caaagatctt gaaaaacctc ttctcagatc 240 
catctcactt ttccattggc cacatttcca atacatctct gccccattaa aaccttacca 300 
aaattgctag ttcagtacta taagtttaaa acaaatgggt tgctcaattc cttttgatgc 360 
atcttccgca tctctcgctc tttttctttt tttctatttt cactgccagc ccctcattac 420 
ctctcttagg tggtcttccc acctgcagtt ttccttttca aatcct 466 

<210> 52 
<211> 449 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 300 

<223> 99-20981-300 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 288. .312 

<223> 99-20981-300. probe 

<220> 

<221> primer_bind 

<222> 281. .299 

<223> 99-20981-300. mis 

<220> 

<221> primer_bind 
<222> 301. . 319 

<223> 99-20981-300. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-20981. pu 

<220> 

<221> primer_bind 
<222> 429. .449 

<223> 99-20981. rp complement 
<400> 52 

tttgccatcc ctttgctgat ggtttgattt ggaatgtgtc atttcgtttg tgaaatgaga 60 
catagtcaag gtacccatgt gctcaagtgc ttagattaca acaatgtttt gatgcaaatg 120 
tgcaaagtac tcagagatca acactatggt ctaataacag gccatagcaa aatagagcaa 180 
atgaatcctg tgaaatcaga gagaatgcat caaataaaga tgtactacct aaagtgcaaa 240 
atctactagg agcactttga catttcagtt caagttctac agtgtttcac attttttcar 300 
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tctctattgt agtctcactt ggaggtgatt gcttgatcac tagaggaata tgatattatt 360 
cgtttcagaa atgaagcagc tatggtgttc tcttgtgaat agcactagat atgaattagg 420 
attatgatct tgtttgtgta gaatgtttc 449 

<210> 53 
<211> 449 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 48 

<223> 99-20983-48 : polymorphic base A or G 



<220> 

<221> miscjDinding 

<222> 36. .60 

<223> 99-20983-48. probe 

<220> 

<221> primer_bind 

<222> 29. . 47 

<223> 99-20983-48. mis 



<220> 

<221> primer_bind 
<222> 49. .67 

<223> 99-20983-48. mis complement 



<220> 

<221> primer_bind 
<222> 1. . 19 
<223> 99-20983. pu 



<220> 

<221> primer_bind 
<222> 431 . .449 

<223> 99-20983. rp complement 



<400> 53 

catgacacat caataagaga agcctgctgg cacagaggag ggcaggtraa accacctttg 60 

tagaagaggc tcatttgaaa atgctcctga aagatatgat atcagaacac tatcttatgt 120 

agagtggata agactttgat gatcttgaaa atgcttactg aggaatacat agcatttatc 180 

tttctttcct tgattaaaga actttcatat atacatattt atcctgtatg gataggtagg 240 

agaggtgatg ccagcccaca gagatgacca gtccctattc cagaagttct gagttgttgc 300 

ttaagtgaag aagtatgctg gtctgctttc aaacttctgt aaacttcagc ttagtggtag 360 

aacagttaga aaatccaaat atttagtttc aatgcctgag tagtccattt actgattttc 420 

accaactctg taccatcacc ttggtctaa 449 



<210> 54 
<211> 466 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 148 

<223> 99-22310-148 : polymorphic base C or T 



<220> 

<221> misc_binding 
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<222> 136. .160 

<223> 99-22310-148. probe 

<220> 

<221> primer_bind 

<222> 129. .147 

<223> 99-22310-148. mis 

<220> 

<221> primer_bind 
<222> 149. .167 

<223> 99-22310-148. mis complement 
<220> 

<221> primer_bind 
<222> 1. . 19 
<223> 99-22310. pu 

<220> 

<221> primer_bind 
<222> 446. .466 

<223> 99-22310. rp complement 
<400> 54 

gtgtgatacg agatcattct ttttttaaaa gctccctagg gtttcagaag tgcagcacag 60 
gttgaaagct accgatatag agcaaagctt tactaaaagc tgatatgtgg aagcaaagaa 120 
tgtgaaatga ctcctgaagg caccaaaytg acaatgcaaa tcaaatccaa gaacaagctc 180 
taaatgtgtg cttgtatttt actgaatccc ttttgtaaac ccatgcaata ttttacccca 240 
atattattgc aatacaaact gggaaagata agatactgga atgtaacctg catcagacag 300 
atagcaatat ctcattgatt tagagcagag cagaccaagt tttggagtag agggagtagt 360 
tcgatgactg gcagagaagc- gggtaagaat ttaaagacaa tagtagaatc attatctctc 420 
tgctaccctg ttgggacaaa gctctctaaa ataagcaact acacag 4 66 

<210> 55 
<211> 471 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 147 

<223> 99-25922-147 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 135. .159 

<223> 99-25922-147. probe 

<220> 

<221> primer_bind 

<222> 128. .146 

<223> 99-25922-147. mis 

<220> 

<221> primerjsind 
<222> 148. .166 

<223> 99-25922-147. mis complement 
<220> 

<221> primer_bind 
<222> 1..21 
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<223> 99-25922. pu 



<220> 

<221> primer_bind 

<222> 451. .471 

<223> 99-25922. rp complement 



<400> 55 

ctctaatatc aataaagcct caaacatgtt tgctgattct aggacatatt gtcaatcacg 60 

cactcacagg agaattttcc atctacctga aataatttaa atttatctta tatttctctt 120 

tgatgagaat catacaaaat ggaattkatc tgaattcttg tgtttgcaga gcacaacata 180 

tagcagaact attaaaaaaa atcaagtaca tttgtaatag ttgaaataca aaagaagttg 240 

gccaagatat ccccaaatat gctaaaaagt cctctaattg tgatgctgaa agaaactttt 300 

ctaaactatc aaaaatgaaa tgcaaattgt agttagtagg gctaaaaaaa ttactaaatt 360 

acctttttct tatctctaca aaaaacatta caaataatat tttgtcgtat gaagtgtgat 420 

caaagtatat gtaactaaaa actaaggaat catttatagt ttatcaggca g 471 

<210> 56 

<211> 499 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 121 

<223> 99-25950-121 : polymorphic base G or C 
<220> 

<221> misc_binding 

<222> 109. . 133 

<223> 99-25950-121. probe 



<220> 

<221> primer_bind 

<222> 102. . 120 

<223> 99-25950-121. mis 



<220> 

<221> primer_bind 
<222> 122. . 140 

<223> 99-25950-121. mis complement 



<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-25950. pu 



<220> 

<221> primer_bind 

<222> 479. . 499 

<223> 99-25950. rp complement 



<400> 56 

ctaccaaatc atcaactacc aaatacctca gtgcattttc tagcattaac tttatttagt 60 

accagaatta tcacagagtc taaagttatt tcacacttaa ttttgtagaa ataaatagat 120 

stgtagaaat cttccttatg gattcaatat tttgggaaag aagcctcagt gatggatatt 180 

ttgttgtgtt agaagtggct atctattgtt actttgtaat aaattatctt aaagggtgtt 240 

tattgtaagt atcacatata atccaggata ttttcctttg gaagaggcaa acccagttaa 300 

attgggaaaa agaatatagt ctgcaaatac aaacattgaa aaatgtgacc tctgcccctt 360 

ttcaactctt ataggacaaa ttcagaactc aactaaatga aattctttat attgttaaaa 420 

aatttcccct agatctcaga cagatgctga gatctgtttt gtagcattca cccacgtacc 480 
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taaatctcac tctcagcaa 

<210> 57 

<211> 484 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 373 

<223> 99-25961-376 : polymorphic base A or C 
<220> 

<221> misc_binding 

<222> 361. .385 

<223> 99-25961-376. probe 

<220> 

<221> primer_bind 

<222> 354. .372 

<223> 99-25961-376. mis 



<220> 

<221> primer_bind 
<222> 374 . . 392 

<223> 99-25961-376. mis complement 
<220> 

<221> primer_bind 
<222> 1. .20 
<223> 99-25961. pu 

<220> 

<221> primer_bind 
<222> 464 .. 484 

<223> 99-25961. rp complement 
<220> 

<221> misc_feature 
<222> 395 

<223> n=a, g, c or t 



<400> 57 

tctcccctgt 

gttggaaaag 

caaggtctgg 

tccatgaaac 

aaatggkttc 

tcccagagct 

cctgggacgg 

acccttgaat 

ggct 



gaatgggaac 
gagggaaaga 
aagggtcact 
tatatgcagt 
cttgttgaat 
tagatttccg 
ggmtacacag 
gtggcagggg 



atgtatgaat 
cagacacacc 
ttctgccttc 
tggagcctga 
ttgtgagaag 
agaaaataaa 
acaacttata 
catggcaggt 



ttgaaattat 
cctggactca 
tttcccttcc 
tggtcttccc 
cctttcccca 
ctatgcagta 
tctcnaaact 
cactgtgaga 



gttctacatt 
gagggttctg 
attctttctt 
cttaccacct 
agtctcttgc 
ggaataggat 
ctggaagacg 
ttagcttctg 



ctcctccaca 
cctgcctcac 
caaataattg 
ccccaagtga 
tttgcttgaa 
tggtgtggct 
atcattgtaa 
ttaaactgag 



<210> 58 

<211> 470 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> allele 
<222> 399 
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<223> 99-25965-399 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 387. .411 

<223> 99-25965-399. probe 

<220> 

<221> primer_bind 

<222> 380. .398 

<223> 99-25965-399. mis 

<220> 

<221> primer__bind 

<222> 400. .418 

<223> 99-25965-399. mis complement 
<220> 

<221> primer_bind 

<222> 1. .21 

<223> 99-25965. pu 

<220> 

<221> primer_bind 
<222> 451. .470 

<223> 99-25965. rp complement 
<400> 58 

ctcaaaatac atatacacca gctttaaatg ggaggccaag ataacggaaa gaaagaatga 60 
ggtttcgatt gaggatgctt agataagaaa agcgcagagc cccctttcct gggtcaagtt 120 
gcttttcagt gccagctcac ctttgtcaac ttcctaggtc aaatgtcatg ggaaaggaaa 180 
ataatgatcc aaatcagtaa ctttaaatga tagaataaat agagaaaaca attggctgag 240 
agaaagctga ctcgattgta acgtttagct actagagaag atgaaagttc aaagaactta 300 
acatgaagtg taaggagata ataagttttg ctcattgtta atttgtctca tgattctgtc 360 
aatgcgtact ggtcacgtga ctttcacaag gaaaatgcrt tattcaccac agcagatcat 420 
cttcctcatt aaaaaccaaa ggtacaatgt gggtactgtg acttttagaa 470 

<210> 59 

<211> 523 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 241 

<223> 99-25966-241 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 229. .253 

<223> 99-25966-241. probe 

<220> 

<221> primerjbind 

<222> 222. .240 

<223> 99-25966-241. mis 

<220> 

<221> primer_bind 
<222> 242. .260 

<223> 99-25966-241. mis complement 
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<220> 

<221> primer_bind 
<222> 1. .21 
<223> 99-25966. pu 

<220> 

<221> primer_bind 
<222> 503. .523 

<223> 99-25966. rp complement 



<400> 59 

caatagtgta 

gacacaggag 

tattatttaa 

aaaaatctga 

rgtcggttga 

tgttcttaaa 

tagaggtaga 

catttcagtg 

ggctggcagg 



aacagccaac 
tccagactaa 
cttctgggca 
gtcatgcttg 
ctgccactcc 
gtccacactt 
tatattggtc 
gcttcaccca 
agatcagctg 



cattaaaaac 
aawgtatgag 
ttcatattta 
atggtatgaa 
ttcctccaag 
accgctcttg 
aggtcaggca 
gtgaagattt 
cacatcatct 



acataaaata 
tatgagttca 
agtataggaa 
agtgtgaggc 
ctcatagcaa 
ctccctcatt 
agctaagctt 
gtttctcact 
tcactgactc 



agtttcataa 
ggaaaggctt 
aagtcatgtc 
atgtattgag 
atcccacttc 
ccccatgtgc 
cagtaacaca 
taggctccaa 
tea 



agatgaaggt 
tctaagaagg 
agtattcagg 
aatgttcaag 
ageaeggaac 
cttgtttatt 
tcaaccctta 
gggtatctta 



<210> 60 

<211> 493 

<212> DNA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
523 



<220> 

<221> allele 
<222> 56 

<223> 99-25967-57 : polymorphic base A or G 



<220> 

<221> misc_binding 

<222> 44 . . 68 

<223> 99-25967-57. probe 

<220> 

<221> primer_bind 

<222> 37. .55 

<223> 99-25967-57. mis 



<220> 

<221> primer_bind 
<222> 57. .75 

<223> 99-25967-57. mis complement 



<220> 

<221> primer_bind 
<222> 1. .20 
<223> 99-25967. pu 



<220> 

<221> primer_bind 
<222> 473. .493 

<223> 99-25967. rp complement 



<400> 60 

aacaagtgac atgacaaatg tacattttct ctatttggac tgttacacag agectrgatt 60 

gaaaaggacg aagagaagaa gcaggcacaa gagaacagct gggctattac tataattgga 120 

ggtgtgagaa agagaaggtc aagaactcct cccagttttc tggcaaaaac aagttaaaaa 180 
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Sfrr^nf ta a"a"ta aattaaaagc aaataaatgt gccaacgaca aagtcagaga 24 0 
gtcctagaag gacaagctcg tgagggatag gaaggtcaag ggttttattt cattgtatta 300 
agaatggaga ttttgctgag ccataccgca cagatatcta gtcagttgga aacataaatc 
taagtgaaga acaaagttct gtgctagaga taattattca agaaccattc ttaaacaaaa 
tgattatact agaaaagata tatcaattac ttttcataca aaacaaaaaa tgcgtatttt 
ggcattctga ttg ^ y 



<210> 61 
<211> 530 
<212> DNA 

<213> Homo sapiens 



360 
420 
480 
493 



<220> 

<221> allele 
<222> 200 

<223> 99-25969-200 : polymorphic base G 
<220> 

<221> misc_binding 

<222> 188. .212 

<223> 99-25969-200. probe 

<220> 

<221> primerjbind 

<222> 181. .199 

<223> 99-25969-200. mis 



<220> 

<221> primer_bind 
<222> 201. .219 

<223> 99-25969-200. mis complement 
<220> 

<221> primerjbind 
<222> 1. .21 
<223> 99-25969. pu 

<220> 

<221> primer_bind 
<222> 510. .530 

<223> 99-25969. rp complement 
<400> 61 

gttagtgact ccttctttta cacatttctg gtttcaaagc agattatcaa gtaaaatttt 60 
aaactgccac caggagattt agctaaggag taccagctga ttttgttcta atgcagtctt 120 
taaaataatg tacactgtga catattcaag tcaagtacag catttatatg ctacacagtc 180 
aagtaaactg tgtagcatak aaatcctacc acattctcca aaccagacag agggagaatc 240 
acagatgccc catgaatcac ctagaaatcc tgtcaacata taacaaatta tatgtggtag 300 
acccataaag tcaatatcaa gttaaaagca ttttttttgt cctcatgtaa aataaactct 360 
ggaaggagag aatgatcttc acatgtcaat gaaaatctga gaaaagggag gggtacagaa 420 
cagagcaaaa tcagagttca attcagaagc cataaggaag caggggaggg agccatgctt 4 80 
accaacagca aagagtgaag tcaggagtgc agaaaagcaa acaccgtagc 530 

<210> 62 
<211> 448 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 315 
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<223> 99-25972-317 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 303. .327 

<223> 99-25972-317. probe 

<220> 

<221> primer_bind 

<222> 296. .314 

<223> 99-25972-317. mis 

<220> 

<221> primer_bind 
<222> 316. .334 

<223> 99-25972-317. mis complement 
<220> 

<221> primer_bind 
<222> 1. .21 
<223> 99-25972. pu 

<220> 

<221> primer Jbind 
<222> 428. .448 

<223> 99-25972. rp complement 
<400> 62 

gacctatgag ttcattggtt cattccactt aatctagaga attcatttta cctagacaga 60 
cactaaaaaa gacacacagg tgcacacaca cttaggcata aattgtcagt ttttagccca 120 
ggtctgtcct attcccatta ttacatataa ttggattaga aatgtcattt taaaattatt 180 
aagctatagg taaggccttt ctttgctaca tatatatata atcttgactt ctttaaaatc 240 
tagcctagaa aacatttttg tttctttttc tctttttcct ggtcccttct ttagctccca 300 
agaatttcag aaaaytgttg cattttagat tattttgcca gctgcatact tttctgtcta 360 
atttgcagat gtcacttact tgataacact tcacacagct gtctaaccca agcttaaatg 420 
attttctgat ttctgaagaa gtggctac 448 

<210> 63 

<211> 544 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 143 

<223> 99-25974-143 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 131. .155 

<223> 99-25974-143. probe 

<220> 

<221> primer_bind 

<222> 124 . .142 

<223> 99-25974-143. mis 

<220> 

<221> primer_bind 
<222> 144. .162 

<223> 99-25974-143. mis complement 
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<220> 

<221> primer_bind 
<222> 1..21 
<223> 99-25974. pu 



<220> 

<221> primer_bind 
<222> 524. .544 

<223> 99-25974. rp complement 



<400> 63 



gctataatct ctccttcctt ggtagccttg gggttgttta ttttcttagg gttttaagtt 60 

atttttcttg gcctgaaaac tcatggtgcc tactgtcaga gctgcacaga acccagtgct 120 

cagccactgc gtgcctccaa gcrctgacat cctaacaaca gtctgggtct ctgaggtctt 180 

ggagcagagg gaggtgagac aaagataaca gggtctaaac tgtgcctact ggaagtctct 240 

gctctcaaat aataaaaaaa cttaaaatga acaacatttt actaaatata gcctgatcac 300 

tgcatttgca tgggggctgg attgatctgg aatgatcaga gctaacatga agctcttttc 360 

tggagatgtg ggacatcaaa gtcacatact aagaagggat gaagacattg cagtgtatcg 420 
gtctccctca ccctgtcctt ctctagcggt accccctctc tgaatctttt cttactggcc 
acaaagaaca aaactctcag tgcaatgcca tgacaaattt ctccaggaag sytaaccaaa 
tcgt 



<210> 64 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 311 

<223> 99-25977-311 : polymorphic base A or G 
<220> 

<221> miscjbinding 

<222> 299. .323 

<223> 99-25977-311. probe 

<220> 

<221> primer_bind 

<222> 292. .310 

<223> 99-25977-311. mis 



480 
540 
544 



<220> 

<221> primer_bind 
<222> 312. .330 

<223> 99-25977-311. mis complement 
<220> 

<221> primer_bind 
<222> 1. .21 
<223> 99-25977. pu 

<220> 

<221> primer_bind 
<222> 520. .540 

<223> 99-25977. rp complement 
<400> 64 

caattctctc aaaattgtct caaataaaca agtattgagc catcgtgatg aattatgttt 
gttcacaggt tattccattt aattttctct tcccacctca ttaagatgta aatagtaact 
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cagcttcata 
aaaggcagaa 
ttaggaaagc 
caagcccatg 
cggtaacgga 
aatattctgt 
agcaaaagaa 



gagaagactg 
gctgctcttc 
aacacttaag 
rttctgtgtt 
ggtggaagtc 
tgcaccctgg 
ccatttaaaa 



aggctaagag 
tactcagatt 
cactttaaat 
aattgtgctc 
agttctactt 
gaacattatc 
tatatatacc 



aaaaggattg 
tctgacttca 
agctatgaaa 
tgatgattgt 
ctctgttgtt 
ttcatccgta 
acagcttctg 



tccaggctga 
attccacaaa 
aatgcaaaat 
tctctttctt 
ttgcaaggaa 
ttttaagctt 
atacagtgtt 



aaggttagaa 
ggtcttaaaa 
ggaatctatt 
tgcagcccca 
tgttcctagg 
tttatagaca 
gaaatactac 



<210> 65 

<211> 538 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 93 

<223> 99-25979-93 : 



polymorphic base A or G 



<220> 

<221> misc_binding 

<222> 81. . 105 

<223> 99-25979-93. probe 

<220> 

<221> primer_bind 

<222> 74 . . 92 

<223> 99-25979-93. mis 



180 
240 
300 
360 
420 
480 
540 



<220> 

<221> primer_bind 

<222> 94 . .112 

<223> 99-25979-93. mis complement 
<220> 

<221> primer_bind 

<222> 1. . 19 

<223> 99-25979. pu 

<220> 

<221> primer_bind 
<222> 518. .538 

<223> 99-25979. rp complement 
<220> 

<221> misc__f eature 

<222> 412,462 

<223> n^a, g, c or t 



<400> 65 

cactacagta atcggagaca atgcagtgac aggattgact acccgtggat cagcctgaag 60 

cttcaccttt tttttcaaag tgcattttga gartatttgt gctatccatt agtaaaaatt 120 

accaaagagg agcataatgg aagcatagag aaatagtaaa aatagaaaca tccagatatt 180 

ctatgaaccc taggaaattt acacaacagt attattattc aatatattgt tgtcgtgtgg 240 

caccaacgcc agagaagaat taattgctga gcaacggccc agacaggagt tcccaagaag 300 

taggttcagg tccatgagtt gggaacttgc tgagttgctg tggatctttg cttctttaaa 360 

gtgttgtcca tggattgcgg gcatcatcag tattacctgg gatcttcgta gnaatcacag 420 

aatcttggtc cctaacccag atctgcagaa tcagttactg cnggcttcac atgataccca 480 

ctggatgtgc atgtacattc aagtttgaaa tgatggtgtg aatcaggcag cagaaatg 538 

<210> 66 
<211> 469 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 194 

<223> 99-25985-194 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 182. .206 

<223> 99-25985-194 .probe 

<220> 

<221> primerjoind 

<222> 175. .193 

<223> 99-25985-194. mis 

<220> 

<221> primer_bind 

<222> 195. .213 

<223> 99-25985-194 .mis complement 
<220> 

<221> primer_bind 

<222> 1. .21 

<223> 99-25985. pu 

<220> 

<221> primer_bind 
<222> 449. .469 

<223> 99-25985. rp complement 
<400> 66 

ctttgcttac aatcaaatga gatacaaggt atttaaacaa ctaactataa tgtttggaca 60 
tctgtgcttt taaaaataga tagactagtt caaaaaaaca caaataacaa aattcataat 120 
aagtaaacac tttgtcactt tgttaagtgt tcagtatcta taacctactt aggtattatt 180 
tcagcataat atayattatt acattaacga caaaaaaata aataccttag atctgtacaa 240 
aagtatagac tgaactagat aagacaagag aacattttta tagatttgaa ctcagagagc 300 
tctgcacgtc actttaagac atggctgaac tgtccaagaa gcatctgtga ttattggggt 360 
gtcctccctg tgaaacttaa aacatattaa tctctgacaa tcacttgcaa tcaggtaaga 420 
ttttcatcct gagctaatgt atgctgtgct gaaaacttta ctccccatt 469 

<210> 67 
<211> 553 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 398 

<223> 99-25989-398 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 386. .410 

<223> 99-25989-398. probe 

<220> 

<221> primer_bind 
<222> 379. .397 
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<223> 99-25989-398. mis 



<220> 

<221> primer_bind 
<222> 399. .417 

<223> 99-25989-398. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-25989. pu 

<220> 

<221> primer_bind 
<222> 533. .553 

<223> 99-25989. rp complement 
<400> 67 

cccatcttat cttcaaatcc ccatgttaca ctataggtgc agtaagtata aaacaaacct 60 
aattagtatt cttagataat gcagacattt aaaaagcaca agtcattttt ggaaaacgaa 120 
gaagtgcaat gccactgagt ttaccctgaa atcatgaaaa tggcttgatg gagtgctaat 180 
gtacatatgc ttaacaataa aaggtaaaat ctctcaccat cacttttggt agtcatttct 240 
ctccgacccc cacatgtaaa ctaattccag ggtaattgca gggtgatagt tttgattttc 300 
tgtattccaa gacagtaatt cttgagagct atttgcatca caactactat cagtggctac 360 
taactcaaat aacagcgata attataataa tcattacrta tttactgagt attcactatg 420 
gactttgcta accatactca agcatactgt ctttccaaga tacaaatgtt ttagaaaatg 480 
aaatttctag aggtaaggcc ctggcctcat tcagattatt tcagttcata ttgggattkr 540 
aaaacaactg tga 

<210> 68 
<211> 451 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 264 

<223> 99-26146-264 : polymorphic base G or T 
<220> 

<221> misc_binding 
<222> 252. .276 
<223> 99-26146-264 .probe 

<220> 

<221> primer Jbind 
<222> 245. .263 
<223> 99-26146-264 .mis 



553 



<220> 

<221> prime r_bind 
<222> 265. .283 

<223> 99-26146-264. mis complement 
<220> 

<221> prime r_bind 
<222> 1. .19 
<223> 99-26146. pu 

<220> 

<221> primer bind 
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<222> 436. .456 

<223> 99-26146. rp complement 
<400> 68 

ttttacaccc atcaaaacct aaaaatcctt catacatttt aatatctacc caggaaatga 60 

gtttaagtga atcaattaag aggaaatgaa tactcgactt attcagaccg aagttgttat 120 

ttgggatttg ttactagtat gtgtattttt attatgacac aactaatgca gtgatgtgtt 180 

atgaaacaaa tctgagccaa aaagcaggtt ttgaaatcaa cttcacttat aaaggcatta 240 

tcttggctac attatttaat atcktttaca tctacagatg cttatcacct taagatggga 300 

ttacttcctt ataattacac atcataaatt caaactattg taagttgaaa tgcatttagt 360 

accttagtaa acctatttta atatcaaaaa aatcccaaat tgaactatct taagttgggg 420 

accatctgta gttyyttgca tgaaaaatag g 451 

<210> 69 
<211> 454 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 276 

<223> 99-26150-276 : polymorphic base A or G 
<220> 

<221> miscjbinding 

<222> 264. .288 

<223> 99-26150-276. probe 

<220> 

<221> primer_bind 

<222> 257. .275 

<223> 99-26150-276. mis 

<220> 

<221> primer_binci 
<222> 277. .295 

<223> 99-26150-276. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-26150. pu 

<220> 

<221> primer_bind 
<222> 437. .454 

<223> 99-26150. rp complement 
<400> 69 

gaaagattag ttgcatgaga taatctgatc tctaaaattc tgccttttta atattttatc 60 
aagggatcca agaacagtta ctgcaatatt cgtaccacaa tcaactttgc aatttgaaaa 120 
gacaccagct tgttggagga gaatgaattg cgagcggaga gagtcagaga ggagataaga 180 
ctcagtttga ggaaggccta ttgagacatt tggttcatgt gacaatctgg tgttttgtac 240 
tgtggtcatt tccacggaga tggagataaa tgagcrtatt ttggacagtt tatttaacag 300 
catactaact tttcatattt tgaggtttct aaagttgaac gcatcttcta attgatgaca 360 
tgtcagtttg atacattgtt tttctctgtt aaaaacatat gctatgatac atggcatctg 420 
agatgcaaag aaatatggta cttttggtgg caca 454 

<210> 70 
<211> 449 
<212> DNA 



178 



WO 01/40493 



PCT/IB00/01507 



<213> Homo sapiens 
<220> 

<221> allele 
<222> 44 

<223> 99-26153-44 : polymorphic base A or C 
<220> 

<221> misc_binding 

<222> 32. .56 

<223> 99-26153-44 .probe 

<220> 

<221> primer_bind 

<222> 25. .43 

<223> 99-26153-44 .mis 

<220> 

<221> primer_bind 
<222> 45. .63 

<223> 99-26153-44 .mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-26153. pu 

<220> 

<221> primer_bind 
<222> 429. .449 

<223> 99-26153. rp complement 
<400> 70 

atgcttaatg cacatctacr taatctatat aaatcaatca atcmatctat gtctacctgt 60 
tcagggtata aaatctttgc atactacgtt aacatagtat acaagttcct tgtgcaacag 120 
atagtgtaca gcaattaact gctgtcaagt tttatctttc ctgtaaaact cccagcacct 180 
gcactgaact tcgctggtca ataagtgcat tatatacaca cccatatgca catacatwat 240 
ataatatttt tatatgtcaa ttgctcctct agattgagtc caatgctcag aattataacg 300 
gtaatttctc aaaaacaaag aaacggaagg aaagaaggaa gggggatgga gagaagtcag 360 
aaacagagaa tgagggaaga agaaaggaag tgctgaaaat gtatttgggg tgaataaaca 420 
catgagatgg tataaagagg caggtaatg 449 

<210> 71 
<211> 511 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 290 

<223> 99-26156-290 : polymorphic base A or C 
<220> 

<221> misc_binding 

<222> 278. .302 

<223> 99-26156-290. probe 

<220> 

<221> primer_bind 

<222> 271. .289 

<223> 99-26156-290. mis 
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<220> 

<221> primer_bind 
<222> 291. .309 

<223> 99-26156-290. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-26156. pu 

<220> 

<221> primer_bind 
<222> 491. .511 

<223> 99-26156. rp complement 
<400> 71 

gatgagtgtg aggtagtgtc tcattttggt tttgagtttt cctaatgatt actgacattg 60 
aaaatctttt tataatgtct atatttataa aattatattt atatagacaa agacccccaa 120 
aaacatattt gtctagggcc ttgtagaccc aaggggaagc cctgatggat tcgaatccta 180 
attccaccac ctattagctg tgtagtgttg aatatatctg taaacctact tgagtatcag 240 
aatttttata ttggctgcat tttaactgta gatagctgtt ctggagcaam aatactgtaa 300 
aattagtctt ttcttgaaaa acaggcagaa cctagtattg acagatataa ctattttttg 360 
cacccctctt ctatatctta tgctattgtc cacgagctta ttatattgta acttgatatt 420 
aattttaaga aagatttgaa attagccata ttttagtatg tcagttttcc ataacaagga 480 
aaaaaaaaca cagaataaag atttaagcca c 511 

<210> 72 

<211> 551 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 189 

<223> 99-27345-189 : polymorphic base G or C 
<220> 

<221> miscjainding 

<222> 177. .201 

<223> 99-27345-189. probe 

<220> 

<221> primer_bind 

<222> 170. .188 

<223> 99-27345-189. mis 

<220> 

<221> primerjDind 

<222> 190. .208 

<223> 99-27345-189. mis complement 
<220> 

<221> primerjDind 

<222> 1. .18 

<223> 99-27345. pu 

<220> 

<221> primer_bind 
<222> 531. .551 

<223> 99-27345. rp complement 
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<400> 72 

taacccacac ggatttagct agagttctag ttagctcatg agtagggaca gaactgggtt 60 

tggagttaat tttactctgt acttatgact taaaaccgga ttggaattac cttctgagta 120 

agtgttccca tattaccata tctaaattag gtgtttcatt ttttataact acggtagcat 



<210> 73 

<211> 514 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 380 

<223> 99-30329-380 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 368. .392 

<223> 99-30329-380. probe 

<220> 

<221> prime r_bincl 

<222> 361. .379 

<223> 99-30329-380. mis 



180 



cttctcagsa tcaactatat aatttttagg gcccagtgaa aactgaaact atgatgtcct 240 



300 



tcactagaaa atgattcaga attttcagga ggtgccagca gagtgttagt ccaaaagcag 

gtccttctac aggacccaga atgatggcac agcctgggaa acaataaagc tggccctgcc 360 

cccttcctca gtatttatca gaacttgtag ttgtaatttt tataaaatac gtacggtaat 420 

ttttctgttg catttgcttg ttttctggaa aactgcaagc ttcacaacgt gagtgacatg 480 

catactgtat ttcttgggtc agctcagcat ctagcacagt gcctgatgca gcagmcaccg 540 
ataaattctg t 



<220> 

<221> primer_bind 

<222> 381. .399 

<223> 99-30329-380. mis complement 
<220> 

<221> primer_bind 

<222> 1. .18 

<223> 99-30329. pu 

<220> 

<221> primer_bind 
<222> 496. .514 

<223> 99-30329. rp complement 



<400> 73 

gacatacccc aaacacagca gcacaaagca actgcctgtg attgtcaatt atctgcacta 60 

actttgaata gacaactgag agtaaaattg accattagct gttattaatc ttttaaatag 120 

cttatttatc aggacaatat tttctttgag gaatggtttc acatcgttga taaacagttt 180 

cctattacta acattaaact aaataatata tgtcatattt cttttttacc cttagaaagt 240 

taatctaata atattattct gattagtggt acttttttta atttttgtaa tgttgtttat 300 

ggtgtttgat ttgtactttt ttgtcttttt catacctttt aataacatcc gatttactac 360 

tggagcattt tattttcaty gtctcttaaa atacttaata acttttagat ttgttgttca 420 

caaaactaaa taattgcttg atgtttcatt ttatccttct tattaaatag gcagctcgtg 480 

tacatatgaa agtgtctttt tatacaccga aacc 514 

<210> 74 
<211> 511 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 342 

<223> 99-31798-344 : variable motif CAT or ACTGTCTCTACCTCA 
<220> 

<221> primer_bind 
<222> 1. .20 
<223> 99-31798. pu 

<220> 

<221> primer_bind 
<222> 491. .511 

<223> 99-31798. rp complement 
<400> 74 

caacccataa caaatggaag gcagagccca atggcaaagg tagtgccaag gcagatgtgt 60 
tccccacagc ctagaagttg gggagagagg aggcctaaaa tgcttccaga ctgggtctac 120 
catagaggaa ggtaagaagg taagatacct taagattcgg gcatggggaa ggggcagaga 180 
aggaaaattg catgatctca tactaggtga cctcagtacc agtgaagttt tgattggatt 240 
tgcgagctga tctaagccaa ctccaacatt caacaatggt aagggaaccc ttagraattc 300 
acttgtaggt aagggctaca tatattcttc attacaacct gactgtctct acctcattat 360 
gcatttatat ccttattata gcagaggtag ttatgattaa agtaactatc agactgaaat 420 
cctcccaaag ggtaaacatt attagcaagc aggaaggaaa caagatatgt ctaggagaga 480 
tttttaagtg cagaaatatc tccactccgt c 511 

<210> 75 

<211> 474 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 159 

<223> 99-5873-159 : polymorphic base C or T 
<220> 

<221> miscjDinding 

<222> 147. .171 

<223> 99-5873-159. probe 

<220> 

<221> primer_bind 

<222> 140. . 158 

<223> 99-5873-159. mis 

<220> 

<221> primer_bind 
<222> 160. .178 

<223> 99-5873-159. mis complement 
<220> 

<221> primer_bind 
<222> 11. .28 
<223> 99-5873. pu 

<220> 

<221> primer_bind 
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<222> 466. .474 

<223> 99-5873. rp complement 



<400> 75 

aaaggagcag gcgtaacaat aagcagggtt agtcgccaca aaacttgaga taagaggaaa 60 

actaaaaaag tctaatgaaa tcagtagtct taaaaagatg acatgatagg aagagaagtg 120 

ttaaaaaaga aaaaaaatag gtatgaaaga gagtaacaya taccggaaaa gggataaaat 180 

acatcctttg aaagaacaaa gagttattca aattgaattc ttaatgaatt acttaaacag 240 

cagattagat attgttaaaa agaggaatag ggaattaaat gatatatgtg atgatattac 300 

ctagtgtaac catcaaagat gtattgcaaa tgataaagaa aaaaatgctg ccatggcaat 360 

attaatatca taaaaatata ctttaagaag taaataaatg caactaggaa tagagaaagc 420 

taatgaataa taatatttaa aaaaaagtat aacaagtata cataagatgt aata 474 

<210> 76 
<211> 490 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 220 

<223> 99-6012-220 : polymorphic base G or T 



<220> 

<221> miscjbinding 

<222> 208. .232 

<223> 99-6012-220. probe 

<220> 

<221> primerjDind 

<222> 201. .219 

<223> 99-6012-220. mis 



<220> 

<221> primer_bind 
<222> 221. .239 

<223> 99-6012-220. mis complement 



<220> 

<221> primer_bind 
<222> 11. .29 
<223> 99-6012. pu 



<220> 

<221> primer_bind 

<222> 477. .490 

<223> 99-6012. rp complement 



<400> 76 

attgtattta gtcttgactt gttttcctga gggtccaggt tgatttgcat gctcttgagg 60 

aaatatacac gtcttctcag ttttaataat tgactgacag ccctgtggtt tctcaggacc 120 

cagtgagctg ctgctcccag gtcagtctgc aaaggatgct ggttcccttg tggtctcatc 180 

aaggtgagga atttcctgat tttagagatt tctttatcck aattttgaag actttctttc 240 

acatttctag gcataaaaaa atgtacagca ctctactgct tgtttaacaa atggatagtg 300 

atatatctgc caacaaagac cacatggagt atttcattga ctatcagaga agtttcctcg 360 

aaaggcacca tacttagtgt tttatttcca tgagtgaagg aaaattagtt atttgaagta 420 

tttggctgtc tttagttgtt tctaaagtag tgctgatttt atatgcccat aatattcata 480 

tatacaccca 4 90 

<210> 77 
<211> 529 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 99 

<223> 99-6080-99 : polymorphic base C or T 
<220> 

<221> miscjbinding 

<222> 87. .111 

<223> 99-6080-99. probe 

<220> 

<221> primer_bind 

<222> 80. . 98 

<223> 99-6080-99. mis 

<220> 

<221> primer_bind 
<222> 100. .118 

<223> 99-6080-99. mis complement 
<220> 

<221> primer_bind 
<222> 11. .28 
<223> 99-6080. pu 

<220> 

<221> primerjDind 

<222> 519. .529 

<223> 99-6080. rp complement 

<400> 77 

acaacaacaa aaatgtgtcc ctgaaaccca tgctatattc aactgaatat tctaatgtct 60 

ttgattacaa agccatctct agcaatttaa tacaattayg aaatggaaaa gttggcaaat 120 

gcaaaacaat agctcgtgtt caaggtatgt ctttattagg ggaagtttat cgaaacagat 180 

gtttatgcta tttcctataa actagattct aaaatatttt attctataaa gatgtattga 240 

ctttatatga aaaaattatt gaaaaatcta caagatggtg aaactcttta gaactatatt 300 

tctattacaa gtttattttt aatttcaaaa atgtactgca taaatgcagc aaaaccttta 360 

ttgtcacata ttaaaacatg tacattattg tgtgcaaatt aaaatttcat taccttaaac 420 

caaaaagtga gttggccaga tagtaaataa tttaggctct aaggctgaaa agcgcttgta 480 

ttaattactc aactccacca ctattttgcc aaagcagtca cagacaata 529 

<210> 78 
<211> 489 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 156 

<223> 99-7308-157 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 144. .168 

<223> 99-7308-157. probe 

<220> 

<221> primer__bind 
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<222> 137. .155 

<223> 99-7308-157. mis 

<220> 

<221> primerjbind 
<222> 157. .175 

<223> 99-7308-157. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-7308. pu 

<220> 

<221> primer_bind 

<222> 469. .489 

<223> 99-7308. rp complement 

<400> 78 

tgtggtctgg atatggtgra ctgtccttca cacacagatg tgggaagcca tgatcatcag 60 

ttgcattatt cctgaggggc aatgcattcc agttacatag aaccagtttc tacgtttcag 120 

ggtatatgta ttcatggtga caaatttatt cacatyttaa gtaattttaa gtaattcaca 180 

ttttaagtaa ttttcctgaa tgtgcctcat tggcttctgt gcctcttcag aaaagatgaa 240 

ctaaacactg gcatatgtgt tcagatttca acattccgtt gttttcattg tggataattt 300 

ctgtcccata tttttgtgta aagttagaca ataaagtgtt aatattctgg cgtcggcaca 360 

ttttctttcc tgataaataa caattcacat atctttttaa aatatcagag aatatagtaa 420 

ccaatttcca attctttttt caccatgtat ctattggagt tttaaaatga ctaatactaa 480 

ggcaactat 489 

<210> 79 
<211> 112 
<212> DNA 

<213> Pan troglodytes 
<400> 79 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctcg cttctaattc ct 112 

<210> 80 
<211> 118 
<212> DNA 

<213> Pan troglodytes 
<400> 80 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga attataaaga 60 
accatctcag aattttactc tgttactgca aaatctcatg caaagataac caagtatg 118 

<210> 81 
<211> 146 
<212> DNA 

<213> Pan troglodytes 
<400> 81 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 60 
ttgccatgaa gacttttcca acatggctgc ttacttctta aagcctgtaa ggagatccag 120 
tctcagttgg ccaagtcttg taaaag 146 

<210> 82 
<211> 94 
<212> DNA 

<213> Pan troglodytes 
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<400> 82 

gaaaataacg catttataaa acccagctca gatgttactt ccaaccaaca aatttcctgg 60 
acacactaag taatagagtt aatcacactg tcct 94 

<210> 83 
<211> 229 
<212> DNA 

<213> Pan troglodytes 
<400> 83 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 60 
ttgccatgaa gacttttcca acatggctgc ttacttctta aagcctgtaa ggagatccag 120 
tctcagttgg ccaagtcttg taaaaggtaa cctcatcacc ggagtcacgt ctcatcacct 180 
ttcccatgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 84 
<211> 205 
<212> DNA 

<213> Pan troglodytes 
<400> 84 

tgaaggtctc cctgctgtgc ttagagaggg ggatgctgga agaagagtgg tcagagaagc 60 
aacattgctg gctctaaaga cagaggacag gccacgagcc aaggcatgca ggaggcctcc 120 
agaagctggg gcaggcaaga gaatggactc agccctggag cctccagaag gcacgcagcc 180 
ctgcccattc cttgctttta gccag 205 

<210> 85 
<211> 82 
<212> DNA 

<213> Pan troglodytes 
<400> 85 

aaacaatacg aacagaaaca aagctatgta cttgttttgg gaagcaacct caaatggaaa 60 
gatgggggct gtgttctaaa ag 82 

<210> 86 
<211> 234 
<212> DNA 

<213> Pan troglodytes 



<400> 86 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 

atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc attttgggtc ctcccagcta aatcatcggc attgtgaagc agagatgaga 180 

catcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac acta 234 

<210> 87 
<211> 714 
<212> DNA 

<213> Pan troglodytes 
<400> 87 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 

atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc attttgggtc ctcccagcta aatcatcggc attgtgaagc agagatgaga 180 

catcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac actagtgagc 240 

ttttacttta agccactaat ttggggtgta gctcttttgc agcaaaagat aatcagaaca 300 

ctgccatgca ggagctgaga aacttgctca ctgttatgcg tgtatccaat gactactcac 360 

acacacatta cttgcttcac tctgctttct ggctttccat tcagcaaatt ttaagaaatg 420 

aaaaagttat tcatgggaaa ttaactcaag atgaatattg ttgtagttaa tctctttgta 480 
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aatggtaaaa tattaaatta ttgtgagtca tttttattta aagaagtttg gttctttaaa 540 

tgaaaaaagt ttgggtcttt tgtggatact ttgcatatag acattctaga catgaaagaa 600 

tgttgaagac aaagaatgtt gaaggaatat agacatgaaa gaatgttgaa gtttggtttt 660 

ttgtggatac tctgcatata gacattctag actggagtga tacatcttgt tcag 714 

<210> 88 
<211> 220 
<212> DNA 

<213> Pan troglodytes 
<400> 88 

acatgaaaga atgttgaagt ttggtttttt gtggatactc tgcatataga cattctagac 60 
tggagtgata catcttgttc aggtttctat gttttcaaaa tgataacttg acacacctac 120 
caatgccaat ttttaggata tatatctgta tatctgtgtg aactaaggaa aattcaggta 180 
tgattaaacc tacatatttt gccaggagag actggataat 220 

<210> 89 
<211> 173 
<212> DNA 

<213> Pan troglodytes 
<220> 

<221> misc_feature 
<222> 91. .92, 131 
<223> n=a, g, c or t 

<400> 89 

ggtcctgatc ttcccagtga cttagcagca cagtaatttg gctgaaatgt tgatgagttc 60 

agacaagaag aaggtgatga tgacaagaga nncgtcacgg ggaaattctg ccctgaggaa 120 

ggaatagttt naggctggag agatgtgcta cccccgccac tctttctcaa atg 173 

<210> 90 
<211> 1105 
<212> DNA 

<213> Pan troglodytes 
<220> 

<221> misc_feature 
<222> 201,744,803, 808. .809 
<223> n=a, g, c or t 

<400> 90 

gaaagacaac ctgtaggcac agaaatcatt gctctgctta cactgaaatc tcctgctgac 
tttgagagct gagcactgga ttttgatggg agagtgaaac ttagtgaagg aatgatttga 
acctttgttg atctaatact tctgtgatac agttatctaa acagaatgag tctaaagtca 
gtataaagac ctgtgtctct ncatgttttg ctacttataa ttgcagagga acattttgag 
atagaaaatt ttatccccct ttcaagtgac tttttataaa acataacctt ttaagagagg 
tcattctttt gaaaaattgt ctgaatacaa gagaagtaac tgaacagcaa gggattgtag 
ccataatgtt cttgaccact ttagattgac aaatagtgtc tcaaaattga tcaggttctg 
gagaatctat tgaggttgta tggagtaacc aagtgatgga agaggttgga agtgaataat 
tttatgtggt attatgtgat tttttttggt gggggtagtg tctggagctg agcatgatgg 
ggcctagatg gggcagcctt ctaatgctcc cagtatttcg caaatcatat gtcacgtcaa 
aattgtaatg acaaagctca cagcagctgc ctacgccagg tttggtgcca gctctaatta 
gagcatcgat catttctctc cccttcatga atgcacgtgc gtgcacattt tcatatacaa 
gggcaaagga catttcaaag tttnggatgg aatctttcct acttagaagt aataaaaagc 
aatactttga aatcttacca aanattannc aaatagcaac aatttttcca tttaaagatg 
tgtgtcttca ggtctgttgg atctttcaaa tttctatttt ggacatggaa actacagatg 
aaatttaact tactcaaagt tacacagaac tcctggctga tctcttttac ttaattcttc 
taaaatttta agtaagaata attttgaaat gttattggta atgcttaaat aacaaaagtt 
actttaatat gttacatgtt ggaagtttca tttatactta atatctaatt taactcttgt 
gaatatatta ttcagttttt ttttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1105 
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<210> 91 
<211> 114 
<212> DNA 

<213> Hylobates sp. 
<220> 

<221> misc__feature 

<222> 45,75 

<223> n=a, g, c or t 



<400> 91 

tcccagtgag attcaaccac tcatagactg cttttgtcca aactngcgtt caaataaggc 60 
aaacgccgag ctggnaaccc atctcactgt ttctgtacct cacttctgat tcct 114 

<210> 92 
<211> 117 
<212> DNA 

<213> Hylobates sp. 
<400> 92 

ttttagaaaa caaatctcct aagagaatga atagaaatta tcttaaggaa ttataaagaa 60 
ccatctcaga attttactct gttactgcaa aatctcatgc aaagatagcc aagtatg 117 

<210> 93 
<211> 146 
<212> DNA 

<213> Hylobates sp. 
<400> 93 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggccac tgatacttcc 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaagtcttg tgtaag 146 

<210> 94 
<211> 229 
<212> DNA 

<213> Hylobates sp. 
<400> 94 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggccac tgatacttcc 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaagtcttg tgtaaggtaa cctcatcacc ggagtcacgt ctcatcacct 180 
ttcccaagtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 95 
<211> 205 
<212> DNA 

<213> Hylobates sp. 
<220> 

<221> misc_feature 
<222> 187 

<223> n=a, g, c or t 
<400> 95 

tgaagttctc cctgctgtgc tcagagaggg gggtgctgga agaagagtgg tcagagaagc 60 
aacattgctg gctctaaaga gagaggacag gccacgagcc aaggcatgca ggaggcctcc 120 
agaagctggg gcaggcaaga gaatggactc ggccctgcag cctccagaag gcatgcagcc 180 
ctgcccnttc cttgctttta gccag 205 
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<210> 96 
<211> 234 
<212> DNA 

<213> Hylobates sp. 



<400> 96 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cagattgcac tgaggccacc 60 

atgctgtaag aaacctcaag ctaccaacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc cttttgggtc ctcccggcta aatcataggc attgtgaagc agagatgaga 180 

cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 97 
<211> 715 
<212> DNA 

<213> Hylobates sp. 



<400> 97 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cagattgcac tgaggccacc 
atgctgtaag aaacctcaag ctaccaacct gaaaagctgg agagaatcag cacccatcag 
cttttccagc cttttgggtc ctcccggcta aatcataggc attgtgaagc agagatgaga 
cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aataggaagc 
ttttacttta agccactaat ttggggtgta gttcttatgc agcaaaagat aatcagaaca 
ctgccatgca ggagctgata aacttgctta ctattatgtg tatatccaat gactactcac 
acacacattt cttgcttcac tctactttct ggctttccat tcagcaaatt ttaagaaatg 
aaaatgttat tcatgggaaa ttaactcaag atgtatattg ttgtagttaa actctttgta 
aatggtaaaa tattaaatta ttgctaatca tctttattta aagaagtttg gttctttaaa 
tgaaagaagt ttgggtcttt tgtggatact ctgcatatag acattctaga catgaaagaa 
tgttgaagac aaagaatgtg gaaggcatat agacatgaaa gaatgttgaa gtttggttct 



<210> 98 
<211> 213 
<212> DNA 

<213> Hylobates sp. 
<400> 98 

acatgaaaga atgttgaagt ttggttcttt tgtggatact ctgcatatag acattctaga 60 
ctggagtgat atatcttgtt caggtttcta tgtttttaaa atgataactt gacacaccta 120 
ccaatgcaaa tttttaggat atatatctgt gtgaactaag gaaaattcag gaatgattaa 180 
acctacatat tttctcagga gagactggat aat 213 

<210> 99 
<211> 209 
<212> DNA 

<213> Hylobates sp. 
<220> 

<221> misc__f eature 

<222> 26. .27 

<223> n=a, g, c or t 



<400> 99 

ggttttgtgc atttctaaga tgatcnnctt ggtgtggggt cctgatcttc ccagtgactt 60 

agcagcacag taatttggct gaaatgttga tgagttaaga caagaagaag gtgacgctga 120 

caagagagcg tcaaggggaa attctgccct gaggaaggaa tagtttgagg ctggagagat 180 

gtgctacccc cgccactctt tctcaaatg 209 

<210> 100 
<211> 1415 
<212> DNA 

<213> Hylobates sp. 
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<220> 

<221> misc_feature 

<222> 343,383,387,570,1118 

<223> n=a, g, c or t 

<400> 100 

gaaagaaaac ctgtaggcac agaaatcatt gctctgctta cactgaaatc tcctgctgac 60 

tttgagaggt gagtgttgga ttttgatgga agggtgaaac ttagtgaaga aatgatttga 120 

acctttgttg atcgaatact tctgtgatac agttatctaa acagaatgag tctaaggtca 180 

gtataaagac ctgtgtctct tcgtgttttg ctacttataa ttgcagagga acgttttgag 240 

atagaaaatt ttatcccact ttcaagtgac tttttataaa acataacctt ttaagagagt 300 

tcattctttt gaaaaattgt ctgaatacaa gaaaagtaac tgnaacagca aggattgtag 360 

ccataatgtt cttgaccact ttnagantga caaatagtgt ctcaaaattg atcaggtgcc 420 

acagaatcta tagaggttgt atggagtaac caactgatgg aagaggttgg acacatctga 480 

ataattttat gtggtattat gtgatgttat tttggtgggg tagtgtctgg agctgagcat 540 

gatggggcct agatggggca gccttctaan gctcccaaca tttcccaaat catatgtcat 600 

ctcaaaattg taatgacaaa gctcacagca gctgcctaca ccaggtttgg tgccagctct 660 

aattagagca tcgagcattt ctctcccctt catgaatgca catgcgtgca cattttcata 720 

tatgagggca aaggacattt caaagtttgg ttggaatctt tcctacttag aagtaataaa 780 

aacaatactt tgaaatctta ccaaatatta aacaaatagc aacaattttt ccatttaaag 840 

gtgtgtgtct tcaggtctgt tggatctgtc aaatttctat tttggacatg gaaaccacag 900 

atgaaattta acttacacaa agttacacag aactcctggc tgatctcttt tacttaattc 960 

ttctaaaacc taaagtaaga ataattttga aattttattg gtaatgctta aataacagtt 1020 

actttaatat attacatgtt tgaagatttt catttatact taatatcgaa tttaacttgt 1080 

gaaaatatta ttcagtattt tgttttgttt tgttttcnaa aatgacttgt tcagggctcc 1140 

ttttgggatt ttaagtgaga aaaactctat ttcataatat tgttaccatg ctcatccaat 1200 

aacacttcat agatggttag ccatctctaa aattaaattt tgtagttgag ggccattgaa 1260 

aataccacct atattgggat ttctgtttaa cacttcaaaa tgtatcatta tcataattta 1320 

aaattagatt taaatacaag cctctttata gaaaacctga taaatatggc tttatttgtt 1380 

tgcttgtttt ttttgcaggg agggatattc ccttg 1415 



<210> 101 
<211> 112 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 
<222> 99 

<223> n=a, g, c or t 
<400> 101 

tcccagtagg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctng cttctaattc ct H2 

<210> 102 
<211> 118 
<212> DNA 

<213> Gorilla gorilla 
<400> 102 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga gttataaaga 60 
accatctcag aattttactc tcttactgca aaatctcatg caaagataac caagtacg 118 

<210> 103 
<211> 146 
<212> DNA 

<213> Gorilla gorilla 
<400> 103 
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ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc ' 60 
ttgccatgaa gacttttcca acatgactgc ttacttcttc aagcctgtaa ggagatccag 120 
tctcagtcgg ccaagtcttg taaaag 146 

<210> 104 
<211> 94 
<212> DNA 

<213> Gorilla gorilla 
<400> 104 

gaaaataacg catttataaa acccagctca gatgttactt ccaaccaaca aatttcctgg 60 
acacactaag taatagagtt aatcacaccg tcct 94 

<210> 105 
<211> 229 
<212> DNA 

<213> Gorilla gorilla 



<400> 105 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 

ttgccatgaa gacttttcca acatgactgc ttacttcttc aagcctgtaa ggagatccag 

tctcagtcgg ccaagtcttg taaaaggtaa cctcatcacc ggagtcacgt ctcatcacct 

ttcccatgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 

<210> 106 
<211> 205 
<212> DNA 

<213> Gorilla gorilla 
<400> 106 

tgaaggtctc cctgctgtgc ttagagaggg ggacgctgga ataagagtgg tcagagaagc 
aacattgctg gctctaaaga cagaggacag gccacgagcc aaggcatgca ggaggcctcc 
agaagctggg gcaggcaaga gaatggactc agccctggag cctccagaag gcacacagcc 
ctgcccgttc cttgctttta gccag 

<210> 107 
<211> 82 
<212> DNA 

<213> Gorilla gorilla 
<400> 107 

aaacaatacc aacagaaaca aagctatgta cttgttttgg gaagcaacat caaatggaaa 
gacgggggct gtgttctaaa ag 

<210> 108 
<211> 234 
<212> DNA 

<213> Gorilla gorilla 



60 
120 
180 
229 



60 
120 
180 
205 



60 
82 



<220> 

<221> misc_feature 
<222> 192 

<223> n=a, g, c or t 



<400> 108 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 

atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc attttgggtt ctcccagcta aatcattggc attgtgaagc agagatgaga 180 

cctcctgcca tnccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 109 
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<211> 714 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 
<222> 192 

<223> n=a, g, c or t 



<400> 109 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 

atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc attttgggtt ctcccagcta aatcattggc attgtgaagc agagatgaga 180 

cctcctgcca tnccctttcc aaatcctgga ccaacagaat catcatatac aatagtaagc 240 

ttttacttta agccactaat ttggggtgta gctcttttgc agcaaaagat aatcagaaca 300 

ctgccatgca ggagctgata aacttgctca ctgttatgta tgtgtccaat gactactcac 360 

acacacatta cttgcttcac tctgctttct ggctttccat tcagcaaatt tgaagaaatg 420 

aaaaagttat tcatgggaaa ttaactcaag atgaatattg ttgtagttaa tctctttgta 480 

aatggtaaaa tattaaatta ttgtgagtag tttttattta aagaagtttg gttctttaaa 540 

tgaaagaagt ttgggtcttt tgtggatacg ttgcatatag acattctaga catgaaagaa 600 

tgttgaagac aaagaatgtt gaaggaatat agacatgaaa gaatgttgaa gtttgttttt 660 

ttgtggatac tctgcatata gacattctag actggagtga tacatcttgt tcag 714 

<210> 110 
<211> 220 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 
<222> 122 

<223> n=a, g, c or t 



<400> 110 

acatgaaaga atgttgaagt ttgttttttt gtggatactc tgcatataga cattctagac 60 

tggagtgata catcttgttc aggtttctat gtttttaaaa tgataacttg acacacctac 120 

cnatgccaat ttttagggta tatatctgta tatctatgtg aactaaggaa aattcaggaa 180 

tgattaaacc tacatatttt gccaggagag accggataat 220 

<210> 111 
<211> 209 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 

<222> 97,115,117,130,134,160,193 

<223> n=a, g, c or t 



<400> 111 

ggttttgtgc atttctaaga tgatcatctt ggtgtggggt cctgatcttc ccagtgactt 60 

agcagcacag taatttggct gaaatgttga tgagttnaga caagaagaag gtgangntga 120 

caagagagcn tcanggggaa attctgccct gaggaaggan tagtttgagg ctggagagat 180 

gtgctacccc cgncactctt tctcaaatg 209 

<210> 112 
<211> 1105 
<212> DNA 

<213> Gorilla gorilla 



<220> 
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<221> misc_feature 
<222> 959 

<223> n=a, g, c or t 



<400> 112 

gaaagaaaac ctgtaggcac agaaatcatt gctctgctta tactgaaatc tcctgctgac 60 
tttgagagct gagcattgga ttttgatggg agagtgaaac ttagtgaaga aatgatttga 120 
acctttgttg atctaatact tctgtgatac agttatctaa acagaatgag tctaaagtca 180 
gtataaagac ctgtgtctct ccatgttttg ctacttataa ttgcagagga acattttgag 240 
atagaaaatt ttatccccct ttcaagtgac tttttataaa gcgtaacctt ttaagagagg 300 
tcattctttt gaaaaattgt ctgaatacaa gagaagtaac tgaacagcaa gggattgtag 360 
ccataatgtt cttgaccact ttagattgac aaatagtgtc tcaaaattga tcaggttccg 420 
gagaatctat tgaggttgta tggaataacc aagtgatgga agaggttgga agtgaataat 480 
tttatgtggt attatgtgat gtttttttgg tgggggtagt gtctggagct gagcgtgatg 540 
ggccctagat ggggcagcct tctaatgctc ccagcatttc gcaaatcata tgtcacgtca 600 
aaattgtaat gacaaagctc acagcagctg cctacgccag gtttggtgcc agctctaatt 660 
agagcatcga tcatttctca cgccttcatg aatgcacatg tgtgcacatt ttcatatata 720 
agggcaaagg acatttcaaa gtttggatgg aatctttcct acttagaagt aataaaaaac 
aatactttga aatcttacca aatattaaat aaatagcaac aatttttcca tttaaagatg 
tgtgtcttaa ggtctgttgg atctttcaaa tttctatttt ggacatggaa actacagatg 
aaatttaact gacacaaagt tacacagaac tcctggctga tctgttttac ttaattctnc 
taaaatctaa agtaagaata attttgaaat tttattggta atgcttaaat aacaaaagtt 
actttaatat gttacatgtt tgaagtttca tttatactta atatctaatt taactcttgt 
gaaaatatta ttcagtattt ttttt 1105 

<210> 113 
<211> 112 
<212> DNA 

<213> Pongo pygmaeus 



780 
840 
900 
960 
1020 
1080 



<220> 

<221> misc_f eature 
<222> 69. .71,99, 106 
<223> n=a, g, c or t 



<400> 113 

tcccagtggg attcaaccac tcatagactg cttttgttcg aactgcgttc aaataaggca 60 
aacgccgann nggaaccaat ctcactgttt ctgtacctna cttatnattc ct 112 

<210> 114 
<211> 118 
<212> DNA 

<213> Pongo pygmaeus 
<400> 114 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga attataaaga 60 
accgtctcag aattttcctc tgttactgca aaatctcatg caaagatgac caagtatg 118 

<210> 115 
<211> 146 
<212> DNA 

<213> Pongo pygmaeus 
<400> 115 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggtcac tcatagttca 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaaatcttg tataag !46 

<210> 116 
<211> 94 
<212> DNA 
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<213> Pongo pygmaeus 
<400> 116 

ggaaataacg catttataaa acccagctca gatattactt ccaaccaaca aattgcctgq 
acacactaag taatagagtt aatcacacca tctt 

<210> 117 
<211> 229 
<212> DNA 

<213> Pongo pygmaeus 
<400> 117 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggtcac tcatagttca 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaaatcttg tataaggtaa cctcatcacc ggagtcatgt ctcatcacct 180 
ttcccgtgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 118 
<211> 234 
<212> DNA 

<213> Pongo pygmaeus 
<400> 118 

gaacattcac tctgagagct ctgtgctgcc atgcaaatgt cagattgcac taaggccgtc 60 
atgctgtaag aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 
cttttccagc cctttgggtc ctcccagcta aatcataggt attgtgaagc agagatgaga 180 
cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 119 
<211> 599 
<212> DNA 

<213> Pongo pygmaeus 
<400> 119 

gaacattcac tctgagagct ctgtgctgcc atgcaaatgt cagattgcac taaggccgtc 60 

atgctgtaag aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc cctttgggtc ctcccagcta aatcataggt attgtgaagc agagatgaga 180 

cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aatagtaagc 240 

ttttacttta agccactaat ttggggtgta gttcttttgc agcaaaagat aatcagaaca 300 

ctgccatgca ggagctgata aacttgttca ctgttatgta tgtatccaat gactactcac 360 

acacacatta cttgcttcac tctactttct ggctttccat tcagcaaatt ttaagaaatg 420 

aaaatgttat tcatgggaaa ttaactcaag atgaatattg ttgtagttaa tctctttgtc 480 

aatggtaaaa tattaaatta ttgtgagtca cttttattta aagaagtttg gttctttaaa 540 

tgaaagaagt ttgggtcttt tgtggatact ctgcatatag accttctaga ctggagtga 599 

<210> 120 
<211> 198 
<212> DNA 

<213> Pongo pygmaeus 
<400> 120 

gaagtttggg tcttttgtgg atactctgca tatagacctt ctagactgga gtgatatatc 60 
ttgttcaggt ttctatgttt ttaaaatgat aacctgacac acctaccatt gccaattttt 120 
aggatatata tctgtgtgaa ctaaggaaag ttcaggaatg attaaaccta catattttgc 180 
caggagagac tggataat * 19Q 



<210> 121 
<211> 152 
<212> DNA 

<213> Pongo pygmaeus 
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<220> 

<221> misc_feature 
<222> 73 

<223> n=a, g, c or t 



<400> 121 



<210> 122 
<211> 119 
<212> DNA 

<213> Macaca mulatta 



60 



tgacttagca gcacagtaat ttggctgaaa tgttgatgag ttcagacaag aaggtgatga 
tgacaagaca gcntcacggg gaaattctgc cctgaggaag gaatagtttg aggctggaga 120 
gatgtgctac ccccgccact ctttgtcaaa tg 



<220> 

<221> misc_feature 
<222> 70,81,83,85,88,93 
<223> n=a, g, c or t 



<400> 122 



tgtcagaaaa caaaaactcc taagagaatt aatagaaatt atcttaagga attataaaga 
acccatctcn cagaaattta ntntnttnac ggnaaaatct catgcaaaga taaccaagt 



<210> 123 
<211> 146 
<212>* DNA 

<213> Macaca mulatta 



60 
119 



<400> 123 

tagtacgtcc acattcattc tagatgctgc ctgaagtcct tagatgcctc tcatagttcc 60 

ttgccatgag cacttttcca acatggctga tgactgcttc aagcctgtaa ggagatctgg 120 

tctcagttgg ccaagtcttg gataat x \ s 

<210> 124 
<211> 74 
<212> DNA 

<213> Macaca mulatta 
<220> 

<221> misc_feature 
<222> 6,37,48,56 
<223> n=a, g, c or t 

<400> 124 

ctcagnatat tacctccaac ctacaaattt cctgggnaca cactaagnta ataagnagtt 60 
aatcacaccg tcct * y T\ 

<210> 125 
<211> 229 
<212> DNA 

<213> Macaca mulatta 



<400> 125 

tagtacgtcc acattcattc tagatgctgc ctgaagtcct tagatgcctc tcatagttcc 60 

ttgccatgag cacttttcca acatggctga tgactgcttc aagcctgtaa ggagatctgg 120 

tctcagttgg ccaagtcttg gataatgtaa cctcatcacc ggagtcacgt ctcatcacct 180 

ttcccatgtt ccattggtta gaagtcccag cttctgccca cattcaaga 229 

<210> 126 
<211> 202 
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<212> DNA 

<213> Macaca mulatta 
<220> 

<221> misc_f eature 
<222> 127. .128,130. .131 
<223> n=a, g, c or t 



<400> 126 

tgaaggtctc cctgctgtgc tgagagaggg gcatgctgga agaagagtgg tcagagaagc 60 
aacattgctg gctctaaaga cagaggacag gccatgaacc aaggcatgca ggcggcctcc 120 
gaaagcnngn ncaggcaaga gaatggactc tgccctggag cctccagaac gcgtgcagcc 180 
ctgcccattc cttgctttta gc 



<210> 127 
<211> 82 
<212> DNA 

<213> Macaca mulatta 
<220> 

<221> misc_feature 
<222> 6,8,30,52,72 
<223> n=a, g, c or t 

<400> 127 



aaacantncc aacagaaaca aagctttgtn cttgttttgg gaagcaacct cnaatggtaa 
gatgggggct gngttctaaa ag yy 



<210> 128 
<211> 243 
<212> DNA 

<213> Macaca mulatta 
<400> 128 



ata 

<210> 129 
<211> 727 
<212> DNA 

<213> Macaca mulatta 



202 



60 
82 



gaacattcac cgtggctgct ctgcgctgcc atacaatgtc agattgcact gaggccgcca 60 

tgctgtaaga aaccccaagc tgcctgcctg aaaagactgg agagacacag cacccatcag 120 

cttttccagc cttttgggtc ctcccagcta aattacaggc attgtgaagc agagatgaga 180 

agagatgaga cctcctgcgt gccctttcca aatcccggct cagcagaatc atcctacaca 240 



243 



<400> 129 

gaacattcac cgtggctgct ctgcgctgcc atacaatgtc agattgcact gaggccgcca 60 

tgctgtaaga aaccccaagc tgcctgcctg aaaagactgg agagacacag cacccatcag 120 

cttttccagc cttttgggtc ctcccagcta aattacaggc attgtgaagc agagatgaga 



180 



agagatgaga cctcctgcgt gccctttcca aatcccggct cagcagaatc atcctacaca 24 0 
atagtaagtt tttacttgaa gccactaatt tggggagtag ttcttatgca gcaaaagata 300 
atcagaacac tgcgccatgg caggagctga taaacctgct cactgttacg cgtgtatcca 360 
atgactactc acacacacat gacttgcttc actctacttt ctggctttcc actcagcaaa 420 
ttttaagaaa tgagaatgtt attcatgaga aattaactca agatgaatat tgttatcgtt 480 
aaactcttcg taaatggtaa aatactacat tattgctagt catttttatt taaagaagtt 
tggttcttta aatgaaagaa gtttgggtct tttgtggata ctctgcatat agatattcta 
gacatgaaag aacgttgaag acaaagaatg ttgaaggcat atagacatga aggaattttg 
aagttgggtt cttttgtgga tactctgcat atagacattc tagactggag tgatatgtct 720 
tgttcaa 



540 
600 
660 



727 



<210> 130 
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<211> 214 
<212> DNA 

<213> Macaca mulatta 
<400> 130 

acatgaagga attttgaagt tgggttcttt tgtggatact ctgcatatag acattctaga 60 
ctggagtgat atgtcttgtt caagcttcta tgtttttaaa atgataactt gacacacc?a 120 
cccaatgcca atttttagga cataaatctg tgtgaactaa ggaaaattca ggaatgaS Ho 
aacctacata ttttcccaag agagactgga taac «gatta 180 



<210> 131 
<211> 77 
<212> DNA 

<213> Macaca mulatta 
<400> 131 



60 
120 
180 
240 



tliltlZt TtltT tC tgCCCt ^^ "99-tagt ttgaggctgg agagatgtgc 60 

<210> 132 
<211> 306 
<212> DNA 

<213> Macaca mulatta 
<400> 132 

lttl^t Ct tgtt <; agggc tccttttggg attttaagtg agaaaaactc tatttcataa 
ttJSJSt III T aataacactt catagatggt tagccatctc taaaattaaa 
aaa^ca"?^ ? gaaaatacca cct.tatttg gatttctgtt taacacttca 

aaatccatta ttatcataat ttaaaattag atttaaatac aaacctcttt atagaaaagc 240 
elicit 9 93ttttattt g tttgcttgt ttttttgcag ggagggatat tcccttga^ 300 

306 

<210> 133 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> sequencing oligonucleotide PrimerPU 
<400> 133 

tgtaaaacga eggecagt 

1 8 

<210> 134 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> sequencing oligonucleotide PrimerRP 



<400> 134 

caggaaacag ctatgacc 



18 
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medicament. In preferred embodiments, the medicament is a drug acting against schizophrenia 
or bipolar disorder. 

Using the method of the present invention, the evaluation of drug efficacy may be 
conducted in a population of individuals likely to respond favorably to the medicament. 

Another aspect of the invention is a method of using a medicament comprising 
obtaining a DN A sample from a subject, determining whether the DNA sample contains alleles 
of one or more bialielic markers associated with a positive response to the medicament and/or 
whether the DNA sample contains alleles of one or more bialielic markers associated with a 
negative response to the medicament, and administering the medicament to the subject if the 
DNA sample contains alleles of one or more bialielic markers associated with a positive 
response to the medicament and/or if the DNA sample lacks alleles of one or more bialielic 
markers associated with a negative response to the medicament. 

The invention also concerns a method for the clinical testing of a medicament, 
preferably a medicament acting against schizophrenia or or bipolar disorder or symptoms 
thereof. The method comprises the following steps: 

- administering a medicament, preferably a medicament susceptible of acting against 
schizophrenia or or bipolar disorder or symptoms thereof to a heterogeneous population of 
individuals, 

- identifying a first population of individuals who respond positively to said 
medicament and a second population of individuals who respond negatively to said 
medicament, 

- identifying bialielic markers in said first population which are associated with a 
positive response to said medicament, 

- selecting individuals whose DNA comprises bialielic markers associated with a 
positive response to said medicament, and 

- administering said medicament to said individuals. 

In any of the methods for the prevention, diagnosis and treatment of schizophrenia and 
bipolar disorder, including methods of using a medicament, clinical testing of a medicament, 
determining whether a subject is likely to respond positively to treatment with a medicament, 
said bialielic marker may optionally comprise: 

(a) a bialielic marker selected from the group consisting of g35030-related markers A13 
to A65; (b) a bialielic marker selected from the group consisting of chromosome 13q31-q33- 
related-biallelic markers A 13 to A 1 8, A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to 
A73, A76 and A 123 ; or (c) a bialielic marker selected from the group consisting of 
chromosome 13q3 l-q33-related-biallelic markers A 13 to A 18, A20 to A47, A49 to A52, A55, 
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A57, A59 to A63. Furthermore, any of the embodiments of the invention may specifically 
exclude one or more of the biallelic markers selected from biallelic markers A 1 to A 127. 

Such methods are deemed to be extremely useful to increase the benefit/risk ratio 
resulting from the administration of medicaments which may cause undesirable side effects 
and/or be inefficacious to a portion of the patient population to which it is normally 
administered. 

Once an individual has been diagnosed as suffering from schizophrenia or bipolar 
disorder, selection tests are carried out to determine whether the DNA of this individual 
comprises alleles of a biallelic marker or of a group of biallelic markers associated with a 
positive response to treatment or with a negative response to treatment which may include either 
side effects or unresponsiveness. 

The selection of the patient to be treated using the method of the present invention can 
be carried out through the detection methods described above. The individuals which are to be 
selected are preferably those whose DNA does not comprise alleles of a biallelic marker or of a 
group of biallelic markers associated with a negative response to treatment. The knowledge of 
an individual's genetic predisposition to unresponsiveness or side effects to particular 
medicaments allows the clinician to direct treatment toward appropriate drugs against 
schizophrenia or bipolar disorder or symptoms thereof. 

Once the patient's genetic predispositions have been determined, the clinician can select 
appropriate treatment for which negative response, particularly side effects, has not been 
reported or has been reported only marginally for the patient. 

The biallelic markers of the invention have demonstrated an association with 
schizophrenia and bipolar disorders. However, the present invention also comprises any of the 
prevention, diagnostic, prognosis and treatment methods described herein using the biallelic 
markers of the invention in methods of preventing, diagnosing, managing and treating related 
disorders, particularly related CNS disorders. By way of example, related disorders may comprise 
psychotic disorders, mood disorders, autism, substance dependence and alcoholism, mental 
retardation, and other psychiatric diseases including cognitive, anxiety, eating, impulse-control, 
and personality disorders, as defined with the Diagnosis and Statistical Manual of Mental 
Disorders fourth edition (DSM-IV) classification". 
Recombinant Vectors 

The term "vector" is used herein to designate either a circular or a linear DNA or RNA 
molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
multicellular host organism. 



WO 01/40493 

PCT/IB00/01507 

109 

The present invention encompasses a family of recombinant vectors that comprise a 
polynucleotide derived from a g35030 nucleic acid sequence. Consequently, the present 
invention further comprises recombinant vectors comprising: g35030 genomic DNA or cDNAs 
comprised in the nucleic acids of any of nucleotide positions 201 123 to 201234, 201 123 to 
20 1 560, 2 1 4676 to 2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 767 i 
to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 
241072 to 241291, 244353 to 244561, 246273 to 247802, 201 123 to 247802, 199122 to 
201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No. 1, SEQ ID Nos 2 to 17 and 
primate g35030 DNAs of SEQ ID Nos 79 to 132, and the complements thereof; 

Generally, a recombinant vector of the invention may comprise any of the 
polynucleotides described herein, as well as any g35030 primer or probe as defined above. 

In a first preferred embodiment, a recombinant vector of the invention is used to 
amplify the inserted polynucleotide derived from a g35030 genomic sequence or cDNA of the 
invention in a suitable cell host, this polynucleotide being amplified at every time that the 
recombinant vector replicates. 

A second preferred embodiment of the recombinant vectors according to the invention 
comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic 
acid of the invention, or both. Within certain embodiments, expression vectors are employed to 
express a g35030 polypeptide which can be then purified and, for example be used in ligand 
screening assays or as an immunogen in order to raise specific antibodies directed against a 
g35030 protein. In other embodiments, the expression vectors are used for constructing 
transgenic animals and also for gene therapy. Expression requires that appropriate signals are 
provided in the vectors, said signals including various regulatory elements, such as 
enhancers/promoters from both viral and mammalian sources that drive expression of the genes 
of interest in host cells. Dominant drug selection markers for establishing permanent, stable cell 
clones expressing the products are generally included in the expression vectors of the invention, 
as they are elements that link expression of the drug selection markers to expression of the 
polypeptide. 

More particularly, the present invention relates to expression vectors which include 
nucleic acids encoding a g35030 protein or variants or fragments thereof, under the control of a 
regulatory sequence of the respective g35030 regulatory polynucleotides, or alternatively under 
the control of an exogenous regulatory sequence. 

The invention also pertains to a recombinant expression vector useful for the expression 
of a g35030 cDNA sequence. 

Recombinant vectors comprising a nucleic acid containing a human chromosome 
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13q31-33-related biallelic marker, or more preferably a g35030-related biallelic marker is also 
part of the invention. In a preferred embodiment, said biallelic marker is selected from the 
group consisting of A 13 to A65 and the complements thereof. 

Some of the elements which can be found in the vectors of the present invention are 
described in further detail in the following sections. 

1. General features of the expression vectors of the invention 
A recombinant vector according to the invention comprises, but is not limited to, a 
YAC (Yeast Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a 
phagemid, a cosmid, a plasmid or even a linear DNA molecule which may comprise a 
chromosomal, non-chromosomal, semi-synthetic and synthetic DNA. Such a recombinant 
vector can comprise a transcriptional unit comprising an assembly of: 

(1) a genetic element or elements having a regulatory role in gene expression, for 
example promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from 
about 10 to 300 bp in length that act on the promoter to increase the transcription. 

(2) a structural or coding sequence which is transcribed into mRNA and eventually 
translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units 
intended for use in yeast or eukaryotic expression systems preferably include a leader sequence 
enabling extracellular secretion of translated protein by a host cell. Alternatively, when a 
recombinant protein is expressed without a leader or transport sequence, it may include a N- 
terminal residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

Generally, recombinant expression vectors will include origins of replication, selectable 
markers permitting transformation of the host cell, and a promoter derived from a highly 
expressed gene to direct transcription of a downstream structural sequence. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably a leader sequence capable of directing secretion of the translated 
protein into the periplasmic space or the extracellular medium. In a specific embodiment 
wherein the vector is adapted for transfecting and expressing desired sequences in mammalian 
host cells, preferred vectors will comprise an origin of replication in the desired host, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination sequences, and S'-flanking non- 
transcribed sequences. DNA sequences derived from the SV40 viral genome, for example 
SV40 origin, early promoter, enhancer, splice and polyadenylation sites may be used to provide 
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the required non-transcribed genetic elements. 

The in vivo expression of a g35030 polypeptide or fragments or variants thereof may be 
useful in order to correct a genetic defect related to the expression of the native gene in a host 
organism or to the production of a biologically inactive g35030 protein. 

Consequently, the present invention also comprises recombinant expression vectors 
mainly designed for the in vivo production of the g35030 polypeptide by the introduction of the 
appropriate genetic material in the organism of the patient to be treated. In preferred 
embodiments, said genetic material comprises at least one nucleotide sequence selected from 
the group of nucleotide posittion ranges consisting of: g35030 genomic DNA or cDNAs 
comprised in the nucleic acids of any of nucleotide positions 201 123 to 201234, 201 1 23 to 
20 1 560, 2 1 4676 to 2 14793, 2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 767 1 
to 217764, 227655 to 227736, 23871 5 to 238919, 240440 to 240673, 240440 to 241 153, 
241072 to 241291, 244353 to 244561, 246273 to 247802, 201 123 to 247802, 199122 to' 
201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No. 1, SEQ IDNos 2 to 17 and 
primate g35030 DNAs of SEQ ID Nos 79 to 132, and the complements thereof; 

This genetic material may be introduced in vitro in a cell that has been previously 
extracted from the organism, the modified cell being subsequently reintroduced in the said 
organism, directly in vivo into the appropriate tissue. 

2. Regulatory Elements 

Promoters 

The suitable promoter regions used in the expression vectors according to the present 
invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid 
sequence of interest is not believed to be important, so long as it is capable of directing the 
expression of the nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is 
preferable to position the nucleic acid coding region adjacent to and under the control of a 
promoter that is capable of being expressed in a human cell, such as, for example, a human or a 
viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression or alternatively can be endogenous to the native polynucleotide 
containing the coding sequence to be expressed. Additionally, the promoter is generally 
heterologous with respect to the recombinant vector sequences within which the construct 
promoter/coding sequence has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
(chloramphenicol transferase) vectors and more preferably P KK232-8 and P CM7 vectors. 
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Preferred bacterial promoters are the Lad, LacZ, the T3 or T7 bacteriophage RNA 
polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin 
promoter, or the plO protein promoter from baculovirus (Kit Novagen) (Smith et al., 1983; 
O'Reilly et al., 1992), the lambda PR promoter or also the trc promoter. 

Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and 
late SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient 
vector and promoter is well within the level of ordinary skill in the art. 

The choice of a promoter is well within the ability of a person skilled in the field of 
genetic engineering. For example, one may refer to the book of Sambrook et al.( 1 989) or also 
to the procedures described by Fuller et al.(1996). 
Other regulatory elements 

One will typically desire to include a polyadenylation signal to effect proper 
polyadenylation of the gene transcript. The nature of the polyadenylation signal is not believed 
to be crucial to the successful practice of the invention, and any such sequence may be 
employed such as human growth hormone and SV40 polyadenylation signals. Also 
contemplated as an element of the expression cassette is a terminator. These elements can serve 
to enhance message levels and to minimize read through from the cassette into other sequences. 

The vector containing the appropriate DNA sequence as described above, more 
preferably a g35030 gene regulatory polynucleotide, a polynucleotide encoding a g35030 
polypeptide comprising at least one nucleotide sequence selected from the group of nucleotide 
sequence ranges consisting of: g35030 genomic DNA or cDNAs comprised in the nucleic acids 
of any of nucleotide positions 201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 
to 215746, 216836 to 2 16994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 
238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 
244561, 246273 to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 
199122 to 249803 of SEQ ID No. 1, SEQ ID Nos 2 to 17 and primate g35030 DNAs of SEQ ID 
Nos 79 to 132, and the complements thereof. 

3. Selectable Markers 

Such markers would confer an identifiable change to the cell permitting easy 
identification of cells containing the expression construct. The selectable marker genes for 
selection of transformed host cells are preferably dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture, TRP1 fori", cerevisiae or tetracycline, rifampicin or 
ampicillin resistance in E. coli, or levan saccharase for mycobacteria, this latter marker being a 
negative selection marker. 

4. Preferred Vectors. 
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Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial 
use can comprise a selectable marker and a bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of pBR322 (ATCC 3701 7). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and 
GEMl (Promega Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and 
commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 
(Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A, 
pNH!8A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 
(Pharmacia); pWLNEO, pSV2CAT, pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, 
pSVL (Pharmacia); pQE-30 (QIAexpress). 

Bacteriophage vectors 

The PI bacteriophage vector may contain large inserts ranging from about 80 to about 

100 kb. 

The construction of PI bacteriophage vectors such as pl58 or p!58/neo8 are notably 
described by Sternberg (l 992, 1994). Recombinant PI clones comprising g35030 
polynucleotide sequences may be designed for inserting large polynucleotides of more than 40 
kb (Linton et al., 1993). To generate PI DNA for transgenic experiments, a preferred protocol 
is the protocol described by McCormick et al.(1994). Briefly, E, coli (preferably strain 
NS3529) harboring the PI plasmid are grown overnight in a suitable broth medium containing 
25 ng/ml of kanamycin. The PI DNA is prepared from the E. coli by alkaline lysis using the 
Qiagen Plasmid Maxi kit (Qiagen, Chatsworth, CA, USA), according to the manufacturers 
instructions. The PI DNA is purified from the bacterial lysate on two Qiagen-tip 500 columns, 
using the washing and elution buffers contained in the kit. A phenol/chloroform extraction is 
then performed before precipitating the DNA with 70% ethanol. After solubilizing the DNA in 
TE (10 mM Tris-HCI, pH 7.4, 1 mM EDTA), the concentration of the DNA is assessed by 
spectrophotpmetry. 

When the goal is to express a PI clone comprising a g35030 polynucleotide sequence in 
a transgenic animal, typically in transgenic mice, it is desirable to remove vector sequences 
from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting sites within the 
P 1 poly I inker (5//1, Notl or Sail). The PI insert is then purified from vector sequences on a 
pulsed-field agarose gel, using methods similar using methods similar to those originally 
reported for the isolation of DNA from YACs (Schedl et al., 1993a; Peterson et al., 1993, ). At 
this stage, the resulting purified insert DNA can be concentrated, if necessary, on a Millipore 
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Ultrafree-MC Filter Unit (Millipore, Bedford, MA, USA - 30,000 molecular weight limit) and 
then dialyzed against microinjection buffer (10 mM Tris-HCI, pH 7.4; 250 \xM EDTA) 
containing 100 mM NaCI, 30 nM spermine, 70 \iM spermidine on a microdyaiisis membrane 
(type VS, 0.025 ^M from Millipore). The intactness of the purified PI DNA insert is assessed 
by electrophoresis on 1% agarose (Sea Kern GTG; FMC Bio-products) pulse-field gel and 
staining with ethidium bromide. 
Baculovirus vectors 

A suitable vector for the expression of a g35030 polypeptide encoded by a 
polynucleotide of SEQ ID Nos. 1 to 17 or fragments or variants thereof is a baculovirus vector 
that can be propagated in insect cells and in insect cell lines. A specific suitable host vector 
system is the pVLl 392/1 393 baculovirus transfer vector (Pharmingen) that is used to transfect 
the SF9 cell line (ATCC N°CRL 171 1) which is derived from Spodoptera frugiperda. 

Other suitable vectors for the expression of the g35030 polypeptide encoded by the 
polynucleotides of SEQ ID Nos. 1 to 17, or fragments or variants thereof in a baculovirus 
expression system include those described by Chai et al.(1993), Vlasak et al.(!983) and Lenhard 
et al.(1996). 

Viral vectors 

In one specific embodiment, the vector is derived from an adenovirus. Preferred 
adenovirus vectors according to the invention are those described by Feldman and Steg (1996) 
or Ohno et al.(1994). Another preferred recombinant adenovirus according to this specific 
embodiment of the present invention is the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an 
adenovirus of animal origin (French patent application N° FR-93. 05954). 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombinant gene delivery systems of choice . for the transfer of exogenous polynucleotides 
in vivo, particularly to mammals, including humans. These vectors provide efficient delivery of 
genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal 
DNA of the host. 

Particularly preferred retroviruses for the preparation or construction of retroviral in 
vitro or in vitro gene delivery vehicles of the present invention include retroviruses selected 
from the group consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, 
Reticuloendotheliosis virus and Rous Sarcoma virus. Particularly preferred Murine Leukemia 
Viruses include the 4070 A and the 1504A viruses, Abelson (ATCC No VR-999), Friend 
(ATCC No VR-245), Gross (ATCC No VR-590), Rauscher (ATCC No VR-998) and Moloney 
Murine Leukemia Virus (ATCC No VR-190; PCT Application No WO 94/24298). Particularly 
preferred Rous Sarcoma Viruses include Bryan high titer (ATCC Nos VR-334, VR-657, VR- 
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726, VR-659 and VR-728). Other preferred retroviral vectors are those described in Roth et 
al.(1996), PCT Application No WO 93/25234, PCT Application No WO 94/ 06920, Roux et al., 
1989,Julan et al., 1992 and Neda et al., 1991. 

Yet another viral vector system that is contemplated by the invention comprises the 
adeno-associated virus (AAV). The adeno-associated virus is a naturally occurring defective 
virus that requires another virus, such as an adenovirus or a herpes virus, as a helper vims for 
efficient replication and a productive life cycle (Muzyczka et al., 1992). It is also one of the 
few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of 
stable integration (Flotte et al., 1992; Samulski et al., 1989; McLaughlin et al., 1989). One 
advantageous feature of AAV derives from its reduced efficacy for transducing primary cells 
relative to transformed cells. 

BAC vectors 

The bacterial artificial chromosome (BAC) cloning system (Shizuya et al., 1992) has 
been developed to stably maintain large fragments of genomic DNA (100-300 kb) in E. colL A 
preferred BAC vector comprises pBeloBACl I vector that has been described by Kim et 
al.(1996). BAC libraries are prepared with this vector using size-selected genomic DNA that 
has been partially digested using enzymes that permit ligation into either the Bam HI or HinaHl 
sites in the vector. Flanking these cloning sites are T7 and SP6 RNA polymerase transcription 
initiation sites that can be used to generate end probes by either RNA transcription or PCR 
methods. After the construction of a BAC library in E. coli, BAC DNA is purified from the 
host cell as a supercoiled circle. Converting these circular molecules into a linear form precedes 
both size determination and introduction of the BACs into recipient cells. The cloning site is 
flanked by two Not I sites, permitting cloned segments to be excised from the vector by Not I 
digestion. Alternatively, the DNA insert contained in the pBeloBACl 1 vector may be 
linearized by treatment of the BAC vector with the commercially available enzyme lambda 
terminase that leads to the cleavage at the unique cosN site, but this cleavage method results in a 
full length BAC clone containing both the insert DNA and the BAC sequences. 
5. Delivery Of The Recombinant Vectors 

In order to effect expression of the polynucleotides and polynucleotide constructs of the 
invention, these constructs must be delivered into a cell. This delivery may be accomplished in 
vitro, as in laboratory procedures for transforming cell lines, or in vivo or ex vivo 9 as in the 
treatment of certain diseases states. 

One mechanism is viral infection where the expression construct is encapsulated in an 
infectious viral particle. 

Several non-viral methods for the transfer of polynucleotides into cultured mammalian 
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cells are also contemplated by the present invention, and include, without being limited to, 
calcium phosphate precipitation (Graham etal., 1973; Chenetal., 1987), DEAE-dextran 
(Gopal, 1 985), electroporation (Tur-Kaspa etal., 1986; Potter etal., 1984), direct 
microinjection (Harland et al., 1985), DNA-loaded liposomes (Nicolau et al, 1982; Fraley et 
al., 1979), and receptor-mediate transfection (Wu and Wu, 1987; 1988). Some of these 
techniques may be successfully adapted for in vivo or ex vivo use. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location 
and orientation via homologous recombination (gene replacement) or it may be integrated in a 
random, non specific location (gene augmentation). In yet further embodiments, the nucleic 
acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such 
nucleic acid segments or "episomes" encode sequences sufficient to permit maintenance and 
replication independent of or in synchronization with the host cell cycle. 

One specific embodiment for a method for delivering a protein or peptide to the interior 
of a cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 
physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the 
naked polynucleotide is taken up into the interior of the cell and has a physiological effect. This 
is particularly applicable for transfer in vitro but it may be applied to in vivo as well. 

Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 
described in PCT application N° WO 90/1 1092 (Vical Inc.) and also in PCT application No. 
WO 95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of 
Tacson et al.( 1 996) and of Huygen et al.( 1 996). 

In still another embodiment of the invention, the transfer of a naked polynucleotide of 
the invention, including a polynucleotide construct of the invention, into cells may be proceeded 
with a particle bombardment (biolistic), said particles being DNA-coated microprojectiles 
accelerated to a high velocity allowing them to pierce cell membranes and enter cells without 
killing them, such as described by Klein et al.(1987). 

In a further embodiment, the polynucleotide of the invention may be entrapped in a 
liposome (Ghosh and Bacchawat, 1991; Wong etal., 1980; Nicolau et al., 1987). 

In a specific embodiment, the invention provides a composition for the in vivo 
production of the g35030 protein or polypeptide described herein. It comprises a naked 
polynucleotide operatively coding for this polypeptide, in solution in a physiologically 
acceptable carrier, and suitable for introduction into a tissue to cause cells of the tissue to 
express the said protein or polypeptide. 
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The amount of vector to be injected to the desired host organism varies according to the 
site of injection. As an indicative dose, it will be injected between 0,1 and 100 ^g of the vector 
in an animal body, preferably a mammal body, for example a mouse body. 

In another embodiment of the vector according to the invention, it may be introduced in 
vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated 
and more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has 
been transformed with the vector coding for the desired g35030 polypeptide or the desired 
fragment thereof is reintroduced into the animal body in order to deliver the recombinant 
protein within the body either locally or systemically. 
Cell Hosts 

Another object of the invention comprises a host cell that have been transformed or 
transfected with one of the polynucleotides described herein, and more precisely a 
polynucleotide comprising a g35030 polynucleotide selected from the group consisting of SEQ 
ID Nos. I to 1 7 and 79 to 1 32, or a fragment or a variant thereof. Are included host cells that 
are transformed (prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant 
vector such as one of those described above. 

Generally, a recombinant host cell of the invention comprises any one of the 
polynucleotides or the recombinant vectors described therein. 

Preferred host cells used as recipients for the expression vectors of the invention are the 
following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-a strain), Bacillus subtilis, 
Salmonella typhimurium, and strains from species like Pseudomonas, Streptomyces and 
Staphylococcus. 

b) Eukaryotic host cells: HeLa cells (ATCC NXCL2; N°CCL2.1; N°CCL2.2), Cv 1 
cells (ATCC N°CCL70), COS cells (ATCC N°CRL1650; N°CRL1651), Sf-9 cells (ATCC 
N°CRL1 71 1), C127 cells (ATCC N° CRL-1804), 3T3 (ATCC N° CRL-6361), CHO (ATCC N° 
CCL-61), human kidney 293. (ATCC N° 45504; N° CRL-1573) and BHK (ECACC N° 
84100501; N° 841 11301). 

c) Other mammaiian host cells. 

G35030 gene expression in mammalian, and typically human, cells may be rendered 
defective with the replacement of a g35030 nucleic acid counterpart in the genome of an animal 
cell by a g35030 polynucleotide according to the invention. These genetic alterations may be 
generated by homologous recombination events using specific DNA constructs that have been 
previously described. 

One kind of cell hosts that may be used are mammal zygotes, such as murine zygotes. 
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For example, murine zygotes may undergo microinjection with a purified DNA molecule of 
interest, for example a purified DNA molecule that has previously been adjusted to a 
concentration range from 1 ng/ml -for BAC inserts- 3 ng/jil -for PI bacteriophage inserts- in 10 
mM Tris-HCI, pH 7.4, 250 \xM EDTA containing 100 mM NaCI, 30 jiM spermine, and70 nM 
spermidine. When the DNA to be microinjected has a large size, polyamines and high salt 
concentrations can be used in order to avoid mechanical breakage of this DNA, as described by 
Schedl et al (1993b). 

Any of the polynucleotides of the invention, including the DNA constructs described 
herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES cell line. 
ES cell lines are derived from pluripotent, uncommitted cells of the inner cell mass of pre- 
implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC n° 
CRL-182 1), ES-D3 (ATCC n° CRL1934 and n° CRL-1 1632), YS001 (ATCC n° CRL-1 1776), 
36.5 (ATCC n° CRL-1 1116). To maintain ES cells in an uncommitted state, they are cultured 
in the presence of growth inhibited feeder cells which provide the appropriate signals to 
preserve this embryonic phenotype and serve as a matrix for ES cell adherence. Preferred 
feeder cells are primary embryonic fibroblasts that are established from tissue of day 13- day 14 
embryos of virtually any mouse strain, that are maintained in culture, such as described by 
Abbondanzo et al.(1993) and are inhibited in growth by irradiation, such as described by 
Robertson (1987), or by the presence of an inhibitory concentration of LIF, such as described by 
Pease and Williams (1990). 

The constructs in the host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. 

Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
chemical induction, and cells are cultivated for an additional period. 

Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in the expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
cell lysing agents. Such methods are well known by the skill artisan. 
Transgenic Animals 

The terms "transgenic animals" or "host animals" are used herein designate animals that 
have their genome genetically and artificially manipulated so as to include one of the nucleic 
acids according to the invention. Preferred animals are non-human mammals and include those 
belonging to a genus selected from Mus (e.g. mice), Rattus (e.g. rats) and Oryctogalus (e.g. 
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rabbits) which have their genome artificially and genetically altered by the insertion of a 
nucleic acid according to the invention. In one embodiment, the invention encompasses non- 
human host mammals and animals comprising a recombinant vector of the invention or a 
g35030 gene disrupted by homologous recombination with a knock out vector. The invention 
also encompasses non-human primates comprising a recombinant vector of the invention or a 
g35030 gene disrupted by homologous recombination with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a 
cloned recombinant or synthetic DNA sequence, more specifically one of the purified or 
isolated nucleic acids comprising a g35030 polynucleotide or a DNA sequence encoding an 
antisense polynucleotide such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any one of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. 

In a first preferred embodiment, these transgenic animals may be good experimental 
models in order to study the diverse pathologies related to cell differentiation, in particular 
concerning the transgenic animals within the genome of which has been inserted one or several 
copies of a polynucleotide encoding a native g35030 protein, or alternatively a mutant g35030 
protein. 

In a second preferred embodiment, these transgenic animals may express a desired 
polypeptide of interest under the control of regulatory polynucleotides which lead to good 
yields in the synthesis of this protein of interest, and optionally a tissue specific expression of 
this protein of interest. 

The design of the transgenic animals of the invention may be made according to the 
conventional techniques well known from the one skilled in the art. For more details regarding 
the production of transgenic animals, and specifically transgenic mice, it may be referred to US 
Patents Nos 4,873,191, issued Oct. 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,215, 
issued Aug 4, 1998. 

Transgenic animals of the present invention are produced by the application of 
procedures which result in an animal with a genome that has incorporated exogenous genetic 
material. The procedure involves obtaining the genetic material, or a portion thereof, which 
encodes either a g35030 polynucleotide or antisense polynucleotide such as described in the 
present specification. 

A recombinant polynucleotide of the invention is inserted into an embryonic or ES stem 
cell line. The insertion is preferably made using electroporation, such as described by Thomas 
et al.(1987). The cells subjected to electroporation are screened (e.g. by selection via selectable 
markers, by PCR or by Southern blot analysis) to find positive cells which have integrated the 
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exogenous recombinant polynucleotide into their genome, preferably via an homologous 
recombination event. An illustrative positive-negative selection procedure that may be used 
according to the invention is described by Mansour et al.(1988). 

Then, the positive cells are isolated, cloned and injected into 3.5 days old blastocysts 
from mice, such as described by Bradley (1 987). The blastocysts are then inserted into a female 
host animal and allowed to grow to term. 

Alternatively, the positive ES cells are brought into contact with embryos at the 2.5 
days old 8-16 cell stage (morulae) such as described by Wood et al.( 1 993) or by Nagy et 
al.(l 993), the ES cells being internalized to colonize extensively the blastocyst including the 
cells which will give rise to the germ line. 

The offspring of the female host are tested to determine which animals are transgenic 
e.g. include the inserted exogenous DNA sequence and which are wild-type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, 
a recombinant expression vector or a recombinant host cell according to the invention. 

Recombinant Cell Lines Derived From The Transgenic Animals Of The Invention. 
A further object of the invention comprises recombinant host cells obtained from a 
transgenic animal described herein. In one embodiment the invention encompasses cells 
derived from non-human host mammals and animals comprising a recombinant vector of the 
invention or a gene comprising a g35030 nucleic acid sequence disrupted by homologous 
recombination with a knock out vector. 

Recombinant cell lines may be established in vitro from cells obtained from any tissue 
of a transgenic animal according to the invention, for example by transfection of primary cell 
cultures with vectors expressing o«c-genes such as SV40 large T antigen, as described by Chou 
(1989)andShayetal.(l991). 

Assays For Identification Of Compounds For Treatment Of Schizophrenia And 

Bipolar Disorder 

The present invention provides assays which may be used to test compounds for their 
ability to treat CNS disorders, and in particular, to ameliorate symptoms of a CNS disorder 
mediated by g35030. In preferred embodiments, compounds tested for their ability to 
ameliorate syptoms of schizophrenia or bipolar disorder mediated by g35030. Compounds may 
also be tested for their ability to treat related disorders, including among others psychotic 
disorders, mood disorders, autism, substance dependence and alcoholism, mental retardation, and 
other psychiatric diseases including cognitive, anxiety, eating, impulse-control, and personality 
disorders, as defined with the Diagnosis and Statistical Manual of Mental Disorders fourth edition 
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(DSM-IV) classification. 

The present invention also provides cell and animal, including primate and mouse, models 
of schizophrenia, bipolar disorder and related disorders. 

In one aspect, provided are non-cell based, cell based and animal based assays for the 
identification of such compounds that affect g35030 activity. g35030 activity may be affected 
by any mechanism; in certain embodiments, g35030 activity is affected by modulating g35030 
gene expression or the activity of the g35030 gene product. 

The present methods allow the identification of compounds that affect g35030 activity 
directly or indirectly. Thus, the non-cell based, cell based and animal assays of the present 
invention may also be used to identify compounds that act on an element of a g35030 pathway 
other tha g35030 itself. These compounds can then be used as a therapeutic treatment to 
modulate g35030 and other gene products involved in schizophrenia, bipolar disorder and 
related disorders. 

Cell and non-cell based assays 

In one aspect, cell based assays using recombinant or non-recombinant cells may be 
used to identify compounds which modulate g35030 activity. 

In one aspect, a cell based assay of the invention encompasses a method for identifying 
a test compound for the treatment of schizophrenia or bipolar disorder comprising (a) exposing 
a cell to a test compound at a concentration and time sufficient to ameliorate an endpoint related 
to schizophrenia or bipolar disorder, and (b) determining the level of g35030 activity in a cell. 
g35030 activity can be measured, for example, by assaying a cell for mRNA transcript level, 
g35030 peptide expression, localization or protein activity. Preferably the test compound is a 
compound capable of or suspected to be capable of ameliorating a symptom of schizophrenia, 
bipolar disorder or a related disorder. Test compounds capable of modulating g35030 activity 
may be selected for use in developing medicaments. Such cell based assays are further 
described herein in the section titled "Method For Screening Ligands That Modulate The 
Expression Of The g35030 Gene." 

In another aspect, a cell based assay of the invention encompasses a method for 
identifying a compound for the treatment of schizophrenia or bipolar disorder comprising (a) 
exposing a cell to a level of g35030 activity sufficient to cause a schizophrenia-related or 
bipolar disorder-related endpoint, and (b) exposing said cell to a test compound. A test 
compound can then be selected according to its ability to ameliorate said schizophrenia-related 
or bipolar disorder-related endpoints. g35030 activity may be provided by any suitable method, 
including but not limited to providing a vector containing a g35030 nucleotide sequence, 
treating said cell with a compound capable of increasing g35030 expression, and treating said 
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cell with a g35030 peptide. Preferably said cell is treated with a g35030 peptide comprising a 
contiguous span of at least 4 amino acids of SEQ ID Nos. 1 8 to 23. Preferably the test 
compound is a compound capable of or suspected to be capable of ameliorating a symptom of 
schizophrenia, bipolar disorder or a related disorder; alternatively, the test compound is 
suspected of exacerbating an endpoint schizophrenia, bipolar disorder or a related disorder. A 
test compound capable of ameliorating any detectable symptom or endpoint of a schizophrenia, 
bipolar disorder or a related disorder may be selected for use in developing medicaments. 

In another embodiment, the invention provides cell and non-cell based assays to g35030 
to determine whether g35030 peptides bind to the cell surface, and to identify compounds for 
the treatment of schizophrenia, bipolar disorder and related disorders that interact with a g35030 
receptor. In one such embodiment, a g35030 polynucleotide, or fragments thereof, is cloned 
into expression vectors such as those described herein. The proteins are purified by size, 
charge, immunochromatography or other techniques familiar to those skilled in the art. 
Following purification, the proteins are labeled using techniques known to those skilled in the 
art. The labeled proteins are incubated with cells or cell lines derived from a variety of organs 
or tissues to allow the proteins to bind to any receptor present on the cell surface. Following the 
incubation, the cells are washed to remove non-specifically bound protein. The labeled proteins 
are detected by autoradiography. Alternatively, unlabeled proteins may be incubated with the 
cells and detected with antibodies having a detectable label, such as a fluorescent molecule, 
attached thereto. Specificity of ceil surface binding may be analyzed by conducting a 
competition analysis in which various amounts of unlabeled protein are incubated along with 
the labeled protein. The amount of labeled protein bound to the cell surface decreases as the 
amount of competitive unlabeled protein increases. As a control, various amounts of an 
unlabeled protein unrelated to the labeled protein is included in some binding reactions. The 
amount of labeled protein bound to the cell surface does not decrease in binding reactions 
containing increasing amounts of unrelated unlabeled protein, indicating that the protein 
encoded by the nucleic acid binds specifically to the cell surface. 

In another embodiment, the present invention comprises non-cell based binding assays, 
wherein a g35030 polypeptide is prepared and purified as in cell based binding assays described 
above. Following purification, the proteins are labeled and incubated with a cell membrane 
extract or isolate derived from any desired cells from any organs, tissue or combination of 
organs or tissues of interest to allow the g3503O polypeptide to bind to any receptor present on a 
membrane. Following the incubation, the membranes are washed to remove non-specifically 
bound protein. The labeled proteins may be detected by autoradiography. Specificity of 
membrane binding of g3S03O may be analyzed by conducting a competition analysis in which 
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various amounts of a test compound are incubated along with the labeled protein. Any desired 
test compound, including test polypeptides, can be incubated with the cells. The test 
compounds may be detected with antibodies having a detectable label, such as a fluorescent 
molecule, attached thereto. The amount of labeled g35030 polypeptide bound to the cell surface 
decreases as the amount of competitive test compound increases. As a control, various amounts 
of an unlabeled protein or a compound unrelated to the test compound is included in some 
binding reactions. Test compounds capable of reducing the amount of g35030 bound to cell 
membranes may be selected as a candidate therapeutic compound. 

Said cell based assays may comprise cells of any suitable origin; particularly preferred 
cells are human cells, primate cells, non-human primate cells and mouse cells. If non-human 
primate cells are used, the g35030 may comprise a nucleotide sequence or be encoded by a 
nucleotide sequence according to the primate nucleic acid sequences of SEQ ID No. 79 to 132, 
or a sequence complementary thereto or a fragment thereof. 

Animal model based assay 

Non-human animal based assays may also be used to identify compounds which 
modulate g35030 activity. The invention encompasses animal models and animal based assays, 
including non-transgenic or transgenic animals, including animals containing a human or altered 
form of the g35030 gene. 

Thus, the present invention comprises treating an animal affected by schizophrenia or 
bipolar disorder or symptoms thereof with a test compound capable of directly or indirectly 
modulating g35030 activity. 

In one aspect, an animal based assay of the invention encompasses a method for 
identifying a test compound for the treatment of schizophrenia or bipolar disorder comprising 
(a) exposing an animal to a test compound at a concentration and time sufficient to ameliorate 
an endpoint related to schizophrenia or bipolar disorder, and (b) determining the level of 
g35030 activity at a site in said animal. g35030 activity can be measured in any suitable cell, 
tissue or site. Preferably the test compound is a compound capable of or suspected to be 
capable of ameliorating a symptom of schizophrenia, bipolar disorder or a related disorder. 
Optionally said test compound is capable or suspected to be capable of modulating g35030 
activity. Test compounds capable of modulating g35030 activity may be selected for use in 
developing medicaments. 

In another aspect, an animal based assay of the invention encompasses a method for 
identifying a compound for the treatment of schizophrenia or bipolar disorder comprising (a) 
exposing an animal to a level of g35030 activity sufficient to cause a schizophrenia-related or 
bipolar disorder-related symptom or endpoint, and (b) exposing said animal to a test compound. 
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A test compound can then be selected according to its ability to ameliorate said schizophrenia- 
related or bipolar disorder-related endpoints. g35030 activity may be provided by any suitable 
method, including but not limited to providing a vector containing a g35030 nucleotide 
sequence, treating said animal with a compound capable of increasing g35030 expression and 
treating said cell with a g35030 peptide. Preferably, said animal is treated with a g35030 
peptide comprising a contiguous span of at least 4 amino acids of SEQ TD Nos. 1 8 to 23. 
Preferably the test compound is a compound capable of or suspected to be capable of 
ameliorating a symptom of schizophrenia, bipolar disorder or a related disorder; alternatively, 
the test compound is suspected of exacerbating a symptom of schizophrenia, bipolar disorder or 
a related disorder. A test compound capable of ameliorating any detectable symptom or 
endpoint of a schizophrenia, bipolar disorder or a related disorder may be selected for use in 
developing medicaments. 

Any suitable animal may be used. Preferably, said animal is a primate, a non-human 
primate, a mammal, or a mouse. 

In one embodiment, a mouse is treated with a g35030 peptide, exposed to a test 
compound, and symptoms indicative of schizophrenia, bipolar disorder or a related disorder are 
assessed by observing stereotypy. In other embodiments, said symptoms are assessed by 
performing at least one test from the group consisting of home cage observation, neurological 
evaluation, stress-induced hypothermia, forced swim, PTZ seizure, locomotor activity, tail 
suspension, elevated plus maze, novel object recognition, prepulse inhibition, thermal pain, Y- 
maze, and metabolic chamber tests (Psychoscreen™ tests available from Psychogenics Inc.). 
Other tests are known in Crawley etal, Horm. Behav. 3 1(3): 197-2 1 1 (1997); Crawley, Brain 
Res 83 5( 1 ): 1 8-26 ( 1 999) for example. 

Any suitable test compound may be used with the screening methods of the invention. 
Examples of compounds that may be screened by the methods of the present invention include 
small organic or inorganic molecules, nucleic acids, including polynucleotides from random and 
directed polynucleotide libraries, peptides, including peptides derived from random and directed 
peptide libraries, soluble peptides, fusion peptides, and phosphopeptides, antibodies including 
polyclonal, monoclonal, chimeric, humanized, and ant i- idiotypic antibodies, and single chain 
antibodies, FAb, F(ab')2 and FAb expression library fragments, and epitope-binding fragments 
thereof. In certain aspects, a compound capable of ameliorating or exacerbating a symptom or 
endpoint of schizophrenia, bipolar disorder or a related disorder may include, by way of 
example, antipsychotic drugs in general, neuroleptics, atypical neuroleptics, antidepressants, 
anti-anxiety drugs, noradrenergic agonists and antagonists, dopaminergic agonists and 
antagonists, serotonin reuptake inhibitors, benzodiazepines. 



WO 01/40493 PCT/IB00/01507 

125 

Methods for screening substances interacting with a g35030 polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to the g35030 
protein or one of its fragments or variants or to modulate the expression of the polynucleotide 
coding for the g35030 or a fragment or variant thereof. 

In the ligand screening method according to the present invention, a biological sample 
or a defined molecule to be tested as a putative ligand of the g35030 protein is brought into 
contact with the corresponding purified g35030 protein, for example the corresponding purified 
recombinant g35030 protein produced by a recombinant cell host as described hereinbefore, in 
order to form a complex between this protein and the putative ligand molecule to be tested. 

As an illustrative example, to study the interaction of the g35030 protein, or a fragment 
comprising a contiguous span of at least 4 amino acids, preferably at least 6, or preferably at 
least 8 to 1 0 amino acids, more preferably at least 12, 1 5, 20, 25, 30, 40, 50, or 100 amino acids 
of SEQ ID Nos 18 to 23, with drugs or small molecules, such as molecules generated through 
combinatorial chemistry approaches, the microdialysis coupled to HPLC method described by 
Wang et al. (1997) or the affinity capillary electrophoresis method described by Bush et al. 
(1997), can be used. 

In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with the g35030 protein, or a fragment comprising a contiguous span of at least 4 amino 
acids, preferably at least 6, or preferably at least 8 to 10 amino acids, more preferably at least 
1 2, 1 5, 20, 25, 30, 40, 50, or 1 00 amino acids of SEQ ID Nos 1 8 to 23, may be identified using 
assays such as the following. The molecule to be tested for binding is labeled with a detectable 
label, such as a fluorescent , radioactive, or enzymatic tag and placed in contact with 
immobilized g35030 protein, or a fragment thereof under conditions which permit specific 
binding to occur. After removal of non-specifically bound molecules, bound molecules are 
detected using appropriate means. 

Another object of the present invention comprises methods and kits for the screening of 
candidate substances that interact with a g35030 polypeptide. 

The present invention pertains to methods for screening substances of interest that 
interact with a g35030 protein or one fragment or variant thereof. By their capacity to bind 
covalently or non-covalently to a g35030 protein or to a fragment or variant thereof, these 
substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro, said interacting molecules may be used as detection means in order to identify 
the presence of a g35030 protein in a sample, preferably a biological sample. 

A method for the screening of a candidate substance comprises the following steps : 
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a) providing a polypeptide comprising, consisting essentially of, or consisting of a 
g35030 protein or a fragment comprising a contiguous span of at least 4 amino acids, preferably 
at least 6 amino acids, more preferably at least 8 to 10 amino acids, more preferably at least 12, 
15, 20, 25, 30, 40, 50, or 100 amino acids of SEQ ID Nos. 18 to 23; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; and 

d) detecting the complexes formed between said polypeptide and said candidate 
substance. 

The invention further concerns a kit for the screening of a candidate substance 
interacting with the g35030 polypeptide, wherein said kit comprises: 

a) a g35030 protein having an amino acid sequence selected from the group consisting 
of the amino acid sequences of SEQ ID Nos. 1 8 to 23 or a peptide fragment comprising a 
contiguous span of at least 4 amino acids, preferably at least 6 amino acids, more preferably at 
least 8 to 10 amino acids, and more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino 
acids of SEQ ID Nos. 1 8 to 23; and 

b) optionally means useful to detect the complex formed between the g35030 protein or 
a peptide fragment or a variant thereof and the candidate substance. 

In a preferred embodiment of the kit described above, the detection means comprise 
monoclonal or polyclonal antibodies directed against the g35030 protein or a peptide fragment 
or a variant thereof. 

Various candidate substances or molecules can be assayed for interaction with a g35030 
polypeptide. These substances or molecules include, without being limited to, natural or 
synthetic organic compounds or molecules of biological origin such as polypeptides. When the 
candidate substance or molecule comprise a polypeptide, this polypeptide may be the resulting 
expression product of a phage clone belonging to a phage-based random peptide library, or 
alternatively the polypeptide may be the resulting expression product of a cDNA library cloned 
in a vector suitable for performing a two-hybrid screening assay. 

The invention also pertains to kits useful for performing the hereinbefore described 
screening method. Preferably, such kits comprise a g35030 polypeptide or a fragment or a 
variant thereof, and optionally means useful to detect the complex formed between the g35030 
polypeptide or its fragment or variant and the candidate substance. In a preferred embodiment 
the detection means comprise monoclonal or polyclonal antibodies directed against the 
corresponding §35030 polypeptide or a fragment or a variant thereof. 
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A. Candidate ligands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the 
expression product of a DNA insert contained in a phage vector (Parmley and Smith, 1988). 
Specifically, random peptide phages libraries are used. The random DNA inserts encode for 
5 peptides of 8 to 20 amino acids in length (Oldenburg K.R. et al., 1 992; Valadon P., et al., 1996; 

Lucas A.H., 1994; Westerink M.A.J., 1995; Felici F. et al., 1991). According to this particular 
embodiment, the recombinant phages expressing a protein that binds to the immobilized g3503O 
protein is retained and the complex formed between the g35030 protein and the recombinant 
phage may be subsequently immunoprecipitated by a polyclonal or a monoclonal antibody 

10 directed against the g35030 protein. 

Once the ligand library in recombinant phages has been constructed, the phage 
population is brought into contact with the immobilized g35030 protein. Then the preparation 
of complexes is washed in order to remove the non-specifically bound recombinant phages. 
The phages that bind specifically to the g35030 protein are then eluted by a buffer (acid pH) or 

1 5 immunoprecipitated by the monoclonal antibody produced by the hybridoma anti- g35030, and 

this phage population is subsequently amplified by an over-infection of bacteria (for example E. 
coli). The selection step may be repeated several times, preferably 2-4 times, in order to select 
the more specific recombinant phage clones. The last step comprises characterizing the peptide 
produced by the selected recombinant phage clones either by expression in infected bacteria and 

20 isolation, expressing the phage insert in another host-vector system, or sequencing the insert 

contained in the selected recombinant phages. 

B. Candidate ligands obtained by competition experiments. 

Alternatively, peptides, drugs or small molecules which bind to the g35030 protein, or a 
fragment comprising a contiguous span of at least 4 amino acids, preferably at least 6 amino 

25 acids, more preferably at least 8 to 10 amino acids, and more preferably at least 12, 15, 20, 25, 

30, 40, 50, or 100 amino acids of SEQ ID Nos. 18 to 23, may be identified in competition 
experiments. In such assays, the g35030 protein, or a fragment thereof, is immobilized to a 
surface, such as a plastic plate. Increasing amounts of the peptides, drugs or small molecules 
are placed in contact with the immobilized g35030 protein, or a fragment thereof, in the 

30 presence of a detectable labeled known g35030 protein ligand. For example, the g35030 ligand 

may be detectably labeled with a fluorescent, radioactive, or enzymatic tag. The ability of the 
test molecule to bind the g35030 protein, or a fragment thereof, is determined by measuring the 
amount of detectably labeled known ligand bound in the presence of the test molecule. A 
decrease in the amount of known ligand bound to the g35030 protein, or a fragment thereof, 
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when the test molecule is present indicated that the test molecule is able to bind to the g35030 
protein, or a fragment thereof. 

C. Candidate ligands obtained by affinity chromatography. 

Proteins or other molecules interacting with the g35030 protein, or a fragment 
comprising a contiguous span of at 4 amino acids, preferably at least 6 amino acids, more 
preferably at least 8 to 10 amino acids, and more preferably at least 12, 15, 20, 25, 30, 40, 50, or 
1 00 amino acids of SEQ ID Nos 1 8 to 23, can also be found using affinity columns which 
contain the g35030 protein, or a fragment thereof. The g35030 protein, or a fragment thereof, 
may be attached to the column using conventional techniques including chemical coupling to a 
suitable column matrix such as agarose, Aflfi Gel® , or other matrices familiar to those of skill 
in art. In some embodiments of this method, the affinity column contains chimeric proteins in 
which the g35030 protein, or a fragment thereof, is fused to glutathion S transferase (GST). A 
mixture of cellular proteins or pool of expressed proteins as described above is applied to the 
affinity column. Proteins or other molecules interacting with the g35030 protein, or a fragment 
thereof, attached to the column can then be isolated and analyzed on 2-D electrophoresis gel as 
described in Ramunsen et al. (1997). Alternatively, the proteins retained on the affinity column 
can be purified by electrophoresis based methods and sequenced. The same method can be used 
to isolate antibodies, to screen phage display products, or to screen phage display human 
antibodies. 

D. Candidate ligands obtained by optical biosensor methods 

Proteins interacting with the g35030 protein, or a fragment comprising a contiguous 
span of at least 4 amino acids, preferably at least 6 amino acids, more preferably at least 8 to 10 
amino acids, and more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of SEQ 
ID Nos. 1 8 to 23, can also be screened by using an Optical Biosensor as described in Edwards 
and Leatherbarrow (1997) and also in Szabo et al. (1995). This technique permits the detection 
of interactions between molecules in real time, without the need of labeled molecules. This 
technique is based on the surface plasmon resonance (SPR) phenomenon. Briefly, the candidate 
ligand molecule to be tested is attached to a surface (such as a carboxymethyl dextran matrix). 
A light beam is directed towards the side of the surface that does not contain the sample to be 
tested and is reflected by said surface. The SPR phenomenon causes a decrease in the intensity 
of the reflected light with a specific association of angle and wavelength. The binding of 
candidate ligand molecules cause a change in the refraction index on the surface, which change 
is detected as a change in the SPR signal. For screening of candidate ligand molecules or 
substances that are able to interact with the g35030 protein, or a fragment thereof, the g35030 
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protein, or a fragment thereof, is immobilized onto a surface. This surface comprises one side 
of a cell through which flows the candidate molecule to be assayed. The binding of the 
candidate molecule on the g35030 protein, or a fragment thereof, is detected as a change of the 
SPR signal. The candidate molecules tested may be proteins, peptides, carbohydrates, lipids, or 
small molecules generated by combinatorial chemistry. This technique may also be performed 
by immobilizing eukaryotic or prokaryotic cells or lipid vesicles exhibiting an endogenous or a 
recombinantly expressed g35030 protein at their surface. 

The main advantage of the method is that it allows the determination of the association 
rate between the g35030 protein and molecules interacting with the g35030 protein. It is thus 
possible to select specifically ligand molecules interacting with the g35030 protein, or a 
fragment thereof, through strong or conversely weak association constants. 

E. Candidate ligands obtained through a two-hybrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo 
(Fields and Song, 1989), and relies upon the fusion of a bait protein to the DNA binding domain 
of the yeast Gal4 protein. This technique is also described in the US Patent N° US 5,667,973 
and the US Patent N° 5,283,173 (Fields et al.). 

The general procedure of library screening by the two-hybrid assay may be performed 
as described by Harper et al. ( 1 993) or as described by Cho et al. (1 998) or also Fromont-Racine 
etal. (1997). 

The bait protein or polypeptide comprises, consists essentially of, or consists of a 
g35030 polypeptide or a fragment comprising a contiguous span of at least 4 amino acids, 
preferably at least 6 amino acids, more preferably at least 8 to 10 amino acids, and more 
preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of SEQ ID Nos. 18to23. 

More precisely, the nucleotide sequence encoding the g35030 polypeptide or a fragment 
or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 
protein, the fused nucleotide sequence being inserted in a suitable expression vector, for 
example pAS2 or pM3. 

Then, a human cDNA library is constructed in a specially designed vector, such that the 
human cDNA insert is fused to a nucleotide sequence in the vector that encodes the 
transcriptional domain of the GAL4 protein. Preferably, the vector used is the pACT vector. 
The polypeptides encoded by the nucleotide inserts of the human cDNA library are termed 
"pray" polypeptides. 

A third vector contains a detectable marker gene, such as beta galactosidase gene or 
CAT gene that is placed under the control of a regulation sequence that is responsive to the 
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binding of a complete Gal4 protein containing both the transcriptional activation domain and 
the DNA binding domain. For example, the vector pG5EC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example 
the two different yeast strains may be the followings : 

- Y190, the phenotype of which is (MATa, Leu2-3. 112 ura3-12, trpl-901, his3-D200, ade2- 
101, gal4Dgal!80D URA3 GAL-LacZ, LYS GAL-HIS3, cytf)\ 

- Y 1 87, the phenotype of which is (AH 7b gal4 gal80 his3 trpl-901 ade2-101 ura3-52 leu2-3 t 
-112 URA3 GAL-lacZmef\ which is the opposite mating type of Y190. 

Briefly, 20 ug of pAS2/ g35030 and 20 jig of pACT-cDNA library are co-transformed 
into yeast strain Y190. The transformants are selected for growth on minimal media lacking 
histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 
mM). Positive colonies are screened for beta galactosidase by filter lift assay. The double 
positive colonies (His*, beta-gat) are then grown on plates lacking histidine, leucine, but 
containing tryptophan and cycloheximide (10 mg/ml) to select for loss of pAS2/ g35030 
plasmids by retention of pACT-cDNA library plasmids. The resulting Y190 strains are mated 
with Y187 strains expressing g35030 or non-related control proteins; such as cyclophilin B, 
lamin, or SNF 1 , as Gal4 fusions as described by Harper et al. ( 1 993) and by Bram et al. (Bram 
RJ et al., 1993), and screened for beta galactosidase by filter lift assay. Yeast clones that are 
beta gal- after mating with the control Gai4 fusions are considered false positives. 

In another embodiment of the two-hybrid method according to the invention, interaction 
between the g35030 or a fragment or variant thereof with cellular proteins may be assessed 
using the Matchmaker Two Hybrid System 2 (Catalog No. K 1604-1, Clontech). As described in 
the manual accompanying the Matchmaker Two Hybrid System 2 (Catalog No. K1604-1, 
Clontech), nucleic acids encoding the g35030 protein or a portion thereof, are inserted into an 
expression vector such that they are in frame with DNA encoding the DNA binding domain of the 
yeast transcriptional activator GAL4. A desired cDNA, preferably human cDNA, is inserted into a 
second expression vector such that they are in frame with DNA encoding the activation domain of 
GAL4. The two expression plasmids are transformed into yeast and the yeast are plated on 
selection medium which selects for expression of selectable markers on each of the expression 
vectors as well as GAL4 dependent expression of the HIS3 gene. Transformants capable of 
growing on medium lacking histidine are screened for GAL4 dependent lacZ expression. Those 
cells which are positive in both the histidine selection and the lacZ assay contain interaction 
between g35030 and the protein or peptide encoded by the initially selected cDNA insert 
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Method For Screening Substances Interacting With The Regulatory Sequences Of 
A g35030 Gene. 

The present invention also concerns a method for screening substances or molecules 
that are able to interact with the regulatory sequences of the g35030 gene, such as for example 
promoter or enhancer sequences. 

Nucleic acids encoding proteins which are able to interact with the regulatory sequences 
of the g35030 gene, more particularly a nucleotide sequence selected from the group consisting 
of the polynucleotides of the 5' and 3' regulatory region or a fragment or variant thereof, and 
preferably a variant comprising one of the biallelic markers of the invention, may be identified 
by using a one-hybrid system, such as that described in the booklet enclosed in the Matchmaker 
One-Hybrid System kit from Clontech (Catalog Ref. n° K 1 603-1). Briefly, the target nucleotide 
sequence is cloned upstream of a selectable reporter sequence and the resulting DNA construct 
is integrated in the yeast genome (Saccharomyces cerevisiae). The yeast cells containing the 
reporter sequence in their genome are then transformed with a library comprising fusion 
molecules between cDNAs encoding candidate proteins for binding onto the regulatory 
sequences of the g35030 gene and sequences encoding the activator domain of a yeast 
transcription factor such as GAL4. The recombinant yeast cells are plated in a culture broth for 
selecting cells expressing the reporter sequence. The recombinant yeast cells thus selected 
contain a fusion protein that is able to bind onto the target regulatory sequence of the g35030 
gene. Then, the cDNAs encoding the fusion proteins are sequenced and may be cloned into 
expression or transcription vectors in vitro. The binding of the encoded polypeptides to the 
target regulatory sequences of the g3503O gene may be confirmed by techniques familiar to the 
one skilled in the art, such as gel retardation assays or DNAse protection assays. 

Gel retardation assays may also be performed independently in order to screen 
candidate molecules that are able to interact with the regulatory sequences of the g35030 gene, 
such as described by Fried and Crothers (1981), Garner and Revzin (1981) and Dent and 
Latchman (1993). These techniques are based on the principle according to which a DNA 
fragment which is bound to a protein migrates slower than the same unbound DNA fragment. 
Briefly, the target nucleotide sequence is labeled. Then the labeled target nucleotide sequence 
is brought into contact with either a total nuclear extract from cells containing transcription 
factors, or with different candidate molecules to be tested. The interaction between the target 
regulatory sequence of the g35030 gene and the candidate molecule or the transcription factor is 
detected after gel or capillary electrophoresis through a retardation in the migration. 



Method For Screening Ligands That Modulate The Expression Of The g35030 
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Gene 

Another subject of the present invention is a method for screening molecules that 
modulate the expression of the g35030 protein. Such a screening method comprises the steps 
of: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a 
nucleotide sequence encoding the g35030 protein or a variant or a fragment thereof, placed 
under the control of its own promoter; 

b) bringing into contact the cultivated cell with a molecule to be tested; 

c) quantifying the expression of the g35030 protein or a variant or a fragment thereof. 
In an embodiment, the nucleotide sequence encoding the g35030 protein or a variant or 

a fragment thereof comprises an allele of at least one g35030 related biallelic marker. 

Using DNA recombination techniques well known by the one skill in the art, the 
g35030 protein encoding DNA sequence is inserted into an expression vector, downstream from 
its promoter sequence. As an illustrative example, the promoter sequence of the g35030 gene is 
contained in the nucleic acid of the 5' regulatory region. 

The quantification of the expression of the g35O30 protein may be realized either at the 
mRNA level or at the protein level. In the latter case, polyclonal or monoclonal antibodies may 
be used to quantify the amounts of the g35030 protein that have been produced, for example in 
an ELIS A or a RIA assay. 

In a preferred embodiment, the quantification of the g35030 mRNA is realized by a 
quantitative PCR amplification of the cDNA obtained by a reverse transcription of the total 
mRNA of the cultivated g35030 -transfected host cell, using a pair of primers specific for 
g35030. 

The present invention also concerns a method for screening substances or molecules 
that are able to increase, or in contrast to decrease, the level of expression of the g35030 gene. 
Such a method may allow the one skilled in the art to select substances exerting a regulating 
effect on the expression level of the g35030 gene and which may be useful as active ingredients 
included in pharmaceutical compositions for treating patients suffering from diseases. 

Thus, is also part of the present invention a method for screening of a candidate 
substance or molecule that modulated the expression of the g35030 gene, this method comprises 
the following steps: 

- providing a recombinant cell host containing a nucleic acid, wherein said nucleic acid 
comprises a nucleotide sequence of the 5* regulatory region or a biologically active fragment or 
variant thereof located upstream a polynucleotide encoding a detectable protein; 

- obtaining a candidate substance; and 
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- determining the ability of the candidate substance to modulate the expression levels of 
the polynucleotide encoding the detectable protein. 

In a further embodiment, the nucleic acid comprising the nucleotide sequence of the 5* 
regulatory region or a biologically active fragment or variant thereof also includes a 5'UTR 
region of the g35030 cDNA of SEQ ID No 2 to 17, or one of its biologically active fragments or 
variants thereof. 

Among the preferred polynucleotides encoding a detectable protein, there may be cited 
polynucleotides encoding beta galactosidase, green fluorescent protein (GFP) and 
chloramphenicol acetyl transferase (CAT). 

The invention also pertains to kits useful for performing the herein described screening 
method. Preferably, such kits comprise a recombinant vector that allows the expression of a 
nucleotide sequence of the 5' regulatory region or a biologically active fragment or variant 
thereof located upstream and operably linked to a polynucleotide encoding a detectable protein 
or the g35030 protein or a fragment or a variant thereof. 

In another embodiment of a method for the screening of a candidate substance or 
molecule that modulates the expression of the g35030 gene, wherein said method comprises the 
following steps: 

a) providing a recombinant host cell containing a nucleic acid, wherein said nucleic acid 
comprises a 5'UTR sequence of a g35030 cDNA, preferably of a g35030, or one of its 
biologically active fragments or variants, the 5'UTR sequence or its biologically active 
fragment or variant being operably linked to a polynucleotide encoding a detectable protein; 

b) obtaining a candidate substance; and 

c) determining the ability of the candidate substance to modulate the expression levels 
of the polynucleotide encoding the detectable protein. 

In a specific embodiment of the above screening method, the nucleic acid that 
comprises a nucleotide sequence selected from the group consisting of the 5'UTR sequence of a 
g35030 cDNA, preferably of a g35030 or one of its biologically active fragments or variants, 
includes a promoter sequence which is endogenous with respect to the g35030 5'UTR sequence. 

In another specific embodiment of the above screening method, the nucleic acid that 
comprises a nucleotide sequence selected from the group consisting of the 5'UTR sequence of a 
g35030 cDNA or one of its biologically active fragments or variants, includes a promoter 
sequence which is exogenous with respect to the g35030 5'UTR sequence defined therein. 

In a further preferred embodiment, the nucleic acid comprising the 5'-UTR sequence of 
a g35030 cDNA or the biologically active fragments thereof includes a g35030-related biallelic 
marker. 



WO 01/40493 PCT/TB00/01507 

134 

The invention further comprises a kit for the screening of a candidate substance 
modulating the expression of the g35030 gene, wherein said kit comprises a recombinant vector 
that comprises a nucleic acid including a 5'UTR sequence of the g35030 cDNA of SEQ ID Nos 
2 to 17, or one of their biologically active fragments or variants, the 5'UTR sequence or its 
biologically active fragment or variant being operably linked to a polynucleotide encoding a 
detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening 
methods described above, it will be referred to the section of the present specification wherein 
the preferred recombinant vectors of the invention are detailed. 

Expression levels and patterns of g35030 may be analyzed by solution hybridization 
with long probes as described in International Patent Application No. WO 97/05277. Briefly, 
the g35030 cDNA or the g35030 genomic DNA described above, or fragments thereof, is 
inserted at a cloning site immediately downstream of a bacteriophage (T3, T7 or SP6) RNA 
polymerase promoter to produce antisense RNA. Preferably, the g35030 insert comprises at 
least 100 or more consecutive nucleotides of the genomic DNA sequence or the cDNA 
sequences. The plasmid is linearized and transcribed in the presence of ribonucleotides 
comprising modified ribonucleotides (i.e. biotin-UTP and DIG-UTP). An excess of this doubly 
labeled RNA is hybridized in solution with mRNA isolated from cells or tissues of interest. The 
hybridization is performed under standard stringent conditions (40-50°C for 16 hours in an 80% 
formamide, 0. 4 M NaCI buffer, pH 7-8). The unhybridized probe is removed by digestion 
with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl, Phy M, U2 or A). 
The presence of the biotin-UTP modification enables capture of the hybrid on a microtitration 
plate coated with streptavidin. The presence of the DIG modification enables the hybrid to be 
detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline phosphatase. 

Quantitative analysis of g35030 gene expression may also be performed using arrays. 
As used herein, the term array means a one dimensional, two dimensional, or multidimensional 
arrangement of a plurality of nucleic acids of sufficient length to permit specific detection of 
expression of mRNAs capable of hybridizing thereto. For example, the arrays may contain a 
plurality of nucleic acids derived from genes whose expression levels are to be assessed. The 
arrays may include the g35030 genomic DNA, the g35030 cDNA sequences or the sequences 
complementary thereto or fragments thereof, particularly those comprising at least one of the 
biallelic markers according the present invention. Preferably, the fragments are at least 15 
nucleotides in length. In other embodiments, the fragments are at least 25 nucleotides in length. 
In some embodiments, the fragments are at least 50 nucleotides in length. More preferably, the 
fragments are at least 100 nucleotides in length. In another preferred embodiment, the 
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fragments are more than 100 nucleotides in length. In some embodiments the fragments may be 
more than 500 nucleotides in length. 

For example, quantitative analysis of g35030 gene expression may be performed with a 
complementary DNA microarray as described by Schena et al.(I995 and 1996). Full length 
g35030 cDNAs or fragments thereof are amplified by PCR and arrayed from a 96-well 
microtiter plate onto silylated microscope slides using high-speed robotics. Printed arrays are 
incubated in a humid chamber to allow rehydration of the array elements and rinsed, once in 0. 
2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 
The arrays are submerged in water for 2 min at 95°C, transferred into 0. 2% SDS for 1 min, 
rinsed twice with water, air dried and stored in the dark at 25°C. 

Cell or tissue mRNA is isolated or commercially obtained and probes are prepared by a 
single round of reverse transcription. Probes are hybridized to 1 cm 2 microarrays under a 14 x 
14 mm glass coverslip for 6-12 hours at 60°C. Arrays are washed for 5 min at 25°C in low 
stringency wash buffer (1 x SSC/0. 2% SDS), then for 10 min at room temperature in high 
stringency wash buffer (0. lx SSC/0. 2% SDS). Arrays are scanned in 0. lxSSCusinga 
fluorescence laser scanning device fitted with a custom filter set. Accurate differential 
expression measurements are obtained by taking the average of the ratios of two independent 
hybridizations. 

Quantitative analysis of g35030 gene expression may also be performed with full length 
g35030 cDNAs or fragments thereof in complementary DNA arrays as described by Pietu et 
al.(1996). The full length g35030 cDNA or fragments thereof is PCR amplified and spotted on 
membranes. Then, mRNAs originating from various tissues or cells are labeled with 
radioactive nucleotides. After hybridization and washing in controlled conditions, the 
hybridized mRNAs are detected by phospho-imaging or autoradiography. Duplicate 
experiments are performed and a quantitative analysis of differentially expressed mRNAs is 
then performed. 

Alternatively, expression analysis using the g35030 genomic DNA, the g35030 cDNA, 
or fragments thereof can be done through high density nucleotide arrays as described by 
Lockhart et al.(1996) and Sosnowsky et al.(1997). Oligonucleotides of 15-50 nucleotides from 
the sequences of the g35030 genomic DNA, the g35030 cDNA sequences particularly those 
comprising at least one of biallelic markers according the present invention, or the sequences 
complementary thereto, are synthesized directly on the chip (Lockhart et al., supra) or 
synthesized and then addressed to the chip (Sosnowski et ah, supra). Preferably, the 
oligonucleotides are about 20 nucleotides in length. 

g35030 cDNA probes labeled with an appropriate compound, such as biotin, 
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digoxigenin or fluorescent dye, are synthesized from the appropriate mRNA population and 
then randomly fragmented to an average size of 50 to 100 nucleotides. The said probes are then 
hybridized to the chip. After washing as described in Lockhart et al., supra and application of 
different electric fields (Sosnowsky et al., 1997)., the dyes or labeling compounds are detected 
and quantified. Duplicate hybridizations are performed. Comparative analysis of the intensity 
of the signal originating from cDNA probes on the same target oligonucleotide in different 
cDNA samples indicates a differential expression of g35030 mRNA. 

Methods For Inhibiting The Expression Of A g35030 Gene 

Other therapeutic compositions according to the present invention comprise 
advantageously an oligonucleotide fragment of the nucleic sequence of g35030 as an antisense 
tool or a triple helix tool that inhibits the expression of the corresponding g35030 gene. A 
preferred fragment of the nucleic sequence of g35030 comprises an allele of at least one of the 
biallelic markers of the invention. 

Antisense Approach 

Preferred methods using antisense polynucleotide according to the present invention are 
the procedures described by Sczakiel et al.(1995). 

Preferably, the antisense tools are chosen among the polynucleotides (15-200 bp long) 
that are complementary to the 5'end of the g35030 mRNA. In another embodiment, a 
combination of different antisense polynucleotides complementary to different parts of the 
desired targeted gene are used. 

Preferred antisense polynucleotides according to the present invention are 
complementary to a sequence of the mRNAs of g35030 that contains either the translation 
initiation codon ATG or a splicing donor or acceptor site. 

The antisense nucleic acids should have a length and melting temperature sufficient to 
permit formation of an intracellular duplex having sufficient stability to inhibit the expression of 
the g35030 mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for 
use in gene therapy are disclosed in Green et al., (1986) and Izant and Weintraub, (1984). 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
g35030 coding region with respect to a promoter so as to transcribe the opposite strand from 
that which is normally transcribed in the cell. The antisense molecules may be transcribed 
using in vitro transcription systems such as those which employ T7 or SP6 polymerase to 
generate the transcript. Another approach involves transcription of g35030 antisense nucleic 
acids in vivo by operably linking DNA containing the antisense sequence to a promoter in a 
suitable expression vector. 
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Alternatively, suitable antisense strategies are those described by Rossi et al.(1991), in 
the International Applications Nos. WO 94/23026, WO 95/04141, WO 92/18522 and in the 
European Patent Application No. EP 0 572 287 A2 

An alternative to the antisense technology that is used according to the present 
invention comprises using ribozymes that will bind to a target sequence via their 
complementary polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing 
its target site (namely "hammerhead ribozymes"). Briefly, the simplified cycle of a 
hammerhead ribozyme comprises (1) sequence specific binding to the target RNA via 
complementary antisense sequences; (2) site-specific hydrolysis of the cleavable motif of the 
target strand; and (3) release of cleavage products, which gives rise to another catalytic cycle. 
Indeed, the use of long-chain antisense polynucleotide (at least 30 bases long) or ribozymes 
with long antisense arms are advantageous. A preferred delivery system for antisense ribozyme 
is achieved by covalently linking these antisense ribozymes to lipophilic groups or to use 
liposomes as a convenient vector. Preferred antisense ribozymes according to the present 
invention are prepared as described by Sczakiel et al.(1995). 

Triple Helix Approach 

The g35030 genomic DNA may also be used to inhibit the expression of the g35030 
gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 
particularly useful for studying alterations in cell activity when it is associated with a particular 
gene. 

Similarly, a portion of the g35030 genomic DNA can be used to study the effect of 
inhibiting g35030 transcription within a cell. Traditionally, homopurine sequences were 
considered the most useful for triple helix strategies. However, homopyrimidine sequences can 
also inhibit gene expression. Such homopyrimidine oligonucleotides bind to the major groove 
at homopurine:homopyrimidine sequences. Thus, both types of sequences from the g35030 
genomic DNA are contemplated within the scope of this invention. 

To cany out gene therapy strategies using the triple helix approach, the sequences of 
the g35030 genomic DNA are first scanned to identify 10-mer to 20-mer homopyrimidine or 
homopurine stretches which could be used in triple-helix based strategies for inhibiting g35030 
expression. Following identification of candidate homopyrimidine or homopurine stretches, 
their efficiency in inhibiting g35030 expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into tissue culture cells which express the 
g35030 gene. 
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The oligonucleotides can be introduced into the cells using a variety of methods known 
to those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE- 
Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced g35030 expression using 
techniques such as Northern blotting, RNase protection assays, or PCR based strategies to 
monitor the transcription levels of the g35030 gene in cells which have been treated with the 
oligonucleotide. 

The oligonucleotides which are effective in inhibiting gene expression in tissue culture 
cells may then be introduced in vivo using the techniques described above in the antisense 
approach at a dosage calculated based on the in vitro results, as described in antisense approach. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, 
an intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the 
alpha oligonucleotide to stabilize the triple helix. For information on the generation of 
oligonucleotides suitable for triple helix formation see Griffin et al.(1989). 

Pharmaceutical Compositions And Formulations 

G35030-modulating Compounds 

Using the methods disclosed herein, compounds that selectively modulate g35030 activity 
in vitro and in vivo may be identified. The compounds identified by the process of the invention 
include, for example, antibodies having binding specificity for the g35030 peptide. It is also 
expected that homologues of g35030 may be useful for modulating g35030-mediated activity and 
the related physiological condition associated with schizophrenia or bipolar disorder. Generally, it 
is further expected that assay methods of the present invention based on the role of g35030 in 
central nervous system disorder may be used to identify compounds capable of intervening in the 
assay cascade of the invention. 

Indications 

While g3503O has demonstrated an association with schizophrenia and bipolar disorder, 
indications involving g35030 may include various central nervous system disorders. Nervous 
system disorders are expected to have complex genetic bases and often share certain symptoms. In 
particular, as described herein, indications may include schizophrenia and other psychotic 
disorders, mood disorders, autism, substance dependence and alcoholism, mental retardation, and 
other psychiatric diseases including cognitive, anxiety, eating, impulse-control, and personality 
disorders, as defined with the Diagnosis and Statistical Manual of Mental Disorders fourth edition 
(DSM-IV) classification. 
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Pharmaceutical Formulations and Routes of Administration 

The compounds identified using the methods of the present invention can be administered 
to a mammal, including a human patient, alone or in pharmaceutical compositions where they are 
mixed with suitable carriers or excipient(s) at therapeutically effective doses to treat or ameliorate 
schizophrenia or bipolar disorder related disorders. A therapeutically effective dose further refers 
to that amount of the compound sufficient to result in amelioration of symptoms as determined by 
the methods described herein. Preferably, a therapeutically effective dosage is suitable for 
continued periodic use or administration. Techniques for formulation and administration of the 
compounds of the instant application may be found in "Remington's Pharmaceutical Sciences," 
Mack Publishing Co., Easton, PA, latest edition. 

Routes of Administration 

Suitable routes of administration include oral, rectal, transmucosal, or intestinal 
administration, parenteral delivery, including intramuscular, subcutaneous, intramedullary 
injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal or 
intraocular injections. A particularly useful method of administering compounds for treating 
central nervous system disease involves surgical implantation of a device for delivering the 
compound over an extended period of time. Sustained release formulations of the invented 
medicaments particularly are contemplated. 

Composition/Formulation 

Pharmaceutical compositions and medicaments for use in accordance with the present 
invention may be formulated in a conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries. Proper formulation is dependent upon 
the route of administration chosen. 

For injection, the agents of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 
physiological saline buffer such as a phosphate or bicarbonate buffer. For transmucosal 
administration, penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 
sorbitol. The push-fit capsules can contain the active ingredients in admixture with fillers such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
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for such administration. 

For buccal administration,the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from pressurized 
packs or a nebulizer, with the use of a suitable gaseous propellant, e.g., carbon dioxide. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a 
metered amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or insufflator, may 
be formulated containing a powder mix of the compound and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by 
bolus injection or continuous infusion. Formulations for injection may be presented in unit 
dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in aqueous vehicles, 
and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Aqueous suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase 
the solubility of the compounds to allow for the preparation of highly concentrated solutions. 

Alternatively, the active ingredient may be in powder or lyophilized form for constitution 
with a suitable vehicle, such as sterile pyrogen-free water, before use. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble salt. 

Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent Various 
sustained release materials have been established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical nature, release the compounds for a 
few weeks up to over 100 days. 

Depending on the chemical nature and the biological stability of the therapeutic reagent, 
additional strategies for protein stabilization may be employed. 
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The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 
or excipients. Examples of such earners or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. 

Effective Dosage 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve their 
intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amounts is well within the capability of those skilled in the 
art, especially in light of the detailed disclosure provided herein. 

For any compound used in the method of the invention, the therapeutically effective dose 
can be estimated initially from cell culture assays, and a dose can be formulated in animal models. 
Such information can be used to more accurately determine useful doses in humans. 

A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms in a patient. Toxicity and therapeutic efficacy of such compounds can 
be determined by standard pharmaceutical procedures in cell cultures or experimental animals, 
e.g., for determining the LD50, (the dose lethal to 50% of the test population) and the ED50 (the 
dose therapeutically effective in 50% of the population). The dose ratio between toxic and 
therapeutic effects is the therapeutic index and it can be expressed as the ratio between LD50 and 
ED50. Compounds which exhibit high therapeutic indices are preferred. 

The data obtained from these cell culture assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the ED50, with little or no toxicity. The 
dosage may vary within this range depending upon the dosage form employed and the route of 
administration utilized. The exact formulation, route of administration and dosage can be chosen 
by the individual physician in view of the patient's condition. (See, e.g., FingI et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1). 

Computer-Related Embodiments 

As used herein the term "nucleic acid codes of the invention" encompass the nucleotide 
sequences comprising, consisting essentially of, or consisting of any one of the following: 

a) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 
150, 200, 500, 1000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, 
wherein said contiguous span comprises at least one of the following nucleotide positions of 
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SEQ ID No 1: 201 123 to 247802 and 199122 to 249803; 

b) a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 
75, 80, 90, 100 or 200 nucleotides, to the extent that such a length is consistent with the 
particular sequence ID, of SEQ ID Nos. 2 to 17 or the complements thereof; 

c) a contiguous span of at least 12, 15, 18, 20, 25,30,35,40, 50, 60, 70, 80, 90, 100, 
150, 200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 24, 25, 28 and 74, and the 
complements thereof, to the extent that such a length is consistent with the particular sequence 
ID; 

d) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 
150, 200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 79 to 132, and the complements 
thereof, to the extent that such a length is consistent with the particular sequence ID; 

e) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
span comprises at least one of the following nucleotide positions of SEQ ID No 1 : 201 123 to 
201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 199122 to 201 122 
and 247803 to 249803; 

f) a contiguous span according to a), b), c), d) or e), wherein said span includes a 
biallelic marker; 

g) a contiguous span according to a), b), c), d) or e), wherein said span includes a 
biallelic marker selected from the group consisting of A13 to A18, A20 to A46, A49 to A52, 
A55, A57, A59 to A63, A72 to A73, A76 and A 123 ; and 

h) a contiguous span according to f), wherein said span includes a biallelic marker with 
the alternative allele present at said biallelic marker; 

The "nucleic acid codes of the invention" further encompass nucleotide sequences 
homologous to a contiguous span of at least 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 
1000 or 2000 nucleotides, to the extent that such a length is consistent with the particular 
sequence of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132, and the complements thereof. 
The "nucleic acid codes of the invention" also encompass nucleotide sequences homologous to 
a contiguous span of at least 12, 1 5, 1 8, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 90 or 
100 nucleotides of SEQ ID No. 1 or the complements thereof, wherein said contiguous span 
comprises at least one of the following nucleotide positions of SEQ ID No. 1 : 201 123 to 
201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
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240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 201 123 to 
247802, 199122 to 201122, 247803 to 249803 and 199122 to 249803. 

Homologous sequences refer to a sequence having at least 99%, 98%, 97%, 96%, 
95%, 90%, 85%, 80%, or 75% homology to these contiguous spans. Homology may be 
determined using any method described herein, including BLAST2N with the default 
parameters or with any modified parameters. Homologous sequences also may include RNA 
sequences in which uridines replace the thymines in the nucleic acid codes of the invention. It 
will be appreciated that the nucleic acid codes of the invention can be represented in the 
traditional single character format (See the inside back cover of Stryer, Lubert. Biochemistry, 

3 rc * edition. W. H Freeman & Co., New York.) or in any other format or code which records 
the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID Nos. 1 8 to 23" encompasses the 
polypeptide sequence of SEQ ID Nos 1 8 to 23, polypeptide sequences homologous to the 
polypeptides of SEQ ID Nos. 1 8 to 23, or fragments of any of the preceding sequences. 
Homologous polypeptide sequences refer to a polypeptide sequence having at least 99%, 98%, 
97%, 96%, 95%, 90%, 85%, 80%, 75% homology to one of the polypeptide sequences of SEQ ID 
Nos. 1 8 to 23. Homology may be determined using any of the computer programs and parameters 
described herein, including FASTA with the default parameters or with any modified parameters. 
The homologous sequences may be obtained using any of the procedures described herein or may 
result from the correction of a sequencing error as described above. The polypeptide fragments 
comprise at least 4, 6, 8, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of 
the polypeptides of SEQ ID Nos. 18 to 23. Preferably, the fragments are novel fragments. It will be 
appreciated that the polypeptide codes of the SEQ ID Nos. 18 to 23 can be represented in the 
traditional single character format or three letter format (See the inside back cover of Starrier, 
Lubert. Biochemistry, 3 rd edition. W. H Freeman & Co., New York.) or in any other format which 
relates the identity of the polypeptides in a sequence. 

It will be appreciated by those skilled in the art that the nucleic acid codes of SEQ ID 
Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 and polypeptide codes of SEQ ID Nos. 1 8 to 23 can 
be stored, recorded, and manipulated on any medium which can be read and accessed by a 
computer. As used herein, the words "recorded" and "stored" refer to a process for storing 
information on a computer medium. A skilled artisan can readily adopt any of the presently known 
methods for recording information on a computer readable medium to generate embodiment 
comprising one or more of nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
1 32, or one or more of the polypeptide codes of SEQ ID Nos. 1 8 to 23. Another aspect of the 
present invention is a computer readable medium having recorded thereon at least 2, 5, 10, 15, 20, 
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25, 30, or 50 nucleic acid codes of SEQ ID Nos I to 17, 24, 25, 28, 74 and 79 to 132. Another 
aspect of the present invention is a computer readable medium having recorded thereon at least 2, 
5, 10, 1 5, 20, 25, 30, or 50 polypeptide codes of SEQ ID Nos 1 8 to 23. 

Computer readable media include magnetically readable media, optically readable media, 
electronically readable media and magnetic/optical media. For example, the computer readable 
media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), 
Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other 
media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 
which store and manipulate the sequence information described herein. One example of a 
computer system 1 00 is illustrated in block diagram form in Figure 12. As used herein, "a 
computer system" refers to the hardware components, software components, and data storage 
components used to analyze the nucleotide sequences of the nucleic acid codes of SEQ ID Nos 1 
to 1 7, 24, 25, 28, 74 and 79 to 1 32, or the amino acid sequences of the polypeptide codes of SEQ 
ID Nos. 1 8 to 23. In one embodiment, the computer system 100 is a Sun Enterprise 1000 server 
(Sun Microsystems, Palo Alto, CA). The computer system 100 preferably includes a processor for 
processing, accessing and manipulating the sequence data. The processor 1 05 can be any well- 
known type of central processing unit, such as the Pentium HI from Intel Corporation, or similar 
processor from Sun, Motorola, Compaq or International Business Machines. 

Preferably, the computer system 100 is a general purpose system that comprises the 
processor 105 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 
skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 

In one particular embodiment, the computer system 100 includes a processor 105 
connected to a bus which is connected to a main memory 1 15 (preferably implemented as RAM) 
and one or more internal data storage devices 1 10, such as a hard drive and/or other computer 
readable media having data recorded thereon. In some embodiments, the computer system 1 00 
further includes one or more data retrieving device 1 1 8 for reading the data stored on the internal 
data storage devices 1 10. 

The data retrieving device 1 1 8 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 10 is 
a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 
containing control logic and/or data recorded thereon. The computer system 100 may 
advantageously include or be programmed by appropriate software for reading the control logic 
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and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 1 00 can be linked to other 
computer systems 125a-c in a network or wide area network to provide centralized access to the 
computer system 100. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes 
of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132, or the amino acid sequences of the 
polypeptide codes of SEQ ID Nos. 1 8 to 23 (such as search tools, compare tools, and modeling 
tools etc.) may reside in main memory 1 15 during execution. 

In some embodiments, the computer system 100 may further comprise a sequence 
comparer for comparing the above-described nucleic acid codes of SEQ ID Nos. I to 1 7, 24, 25, 
28, 74 and 79 to 1 32 or polypeptide codes of SEQ ID Nos. 1 8 to 23 stored on a computer readable 
medium to reference nucleotide or polypeptide sequences stored on a computer readable medium. 
A "sequence comparer" refers to one or more programs which are implemented on the computer 
system 100 to compare a nucleotide or polypeptide sequence with other nucleotide or polypeptide 
sequences and/or compounds including but not limited to peptides, peptidomimetics, and chemicals 
stored within the data storage means. For example, the sequence comparer may compare the 
nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
132, or the amino acid sequences of the polypeptide codes of SEQ ID Nos.'l 8 to 23 stored on a 
computer readable medium to reference sequences stored on a computer readable medium to 
identify homologies, motifs implicated in biological function, or structural motifs. The various 
sequence comparer programs identified elsewhere in this patent specification are particularly 
contemplated for use in this aspect of the invention. 

Figure 13 is a flow diagram illustrating one embodiment of a process 200 for comparing a 
new nucleotide or protein sequence with a database of sequences in order to determine the 
homology levels between the new sequence and the sequences in the database. The database of 
sequences can be a private database stored within the computer system 100, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 

The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 
sequence to be compared is stored to a memory in a computer system 100. As discussed above, the 
memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
stored in the database is read into a memory on the computer. A comparison is then performed at a 
state 210 to determine if the first sequence is the same as the second sequence. It is important to 
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note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 
comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 
can be introduced into one sequence in order to raise the homology level between the two tested 
sequences. The parameters that control whether gaps or other features are introduced into a 
sequence during comparison are normally entered by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, a 
determination is made at a decision state 210 whether the two sequences are the same. Of course, 
the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 
the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 
notifies the user that the sequence with the displayed name fulfills the homology constraints that 
were entered. Once the name of the stored sequence is displayed to the user, the process 200 
moves to a decision state 218 wherein a determination is made whether more sequences exist in the 
database. If no more sequences exist in the database, then the process 200 terminates at an end 
state 220. However, if more sequences do exist in the database, then the process 200 moves to a 
state 224 wherein a pointer is moved to the next sequence in the database so that it can be 
compared to the new sequence. In this manner, the new sequence is aligned and compared with 
every sequence in the database. 

It should be noted that if a determination had been made at the decision state 212 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
state 2 1 8 in order to determine if any other sequences were available in the database for 
comparison. 

Accordingly, one aspect of the present invention is a computer system comprising a 
processor, a data storage device having stored thereon a nucleic acid code of SEQ ID NOs. 1 to 
1 7, 24, 25, 28, 74 and 79 to 132 or a polypeptide code of SEQ ID Nos 1 8 to 23, a data storage 
device having retrievably stored thereon reference nucleotide sequences or polypeptide 
sequences to be compared to the nucleic acid code of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 
79 to 132 or polypeptide code of SEQ ID Nos. 1 8 to 23 and a sequence comparer for conducting 
the comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify structural motifs in the above described nucleic acid code of SEQ ID Nos. 
1 to 1 7, 24, 25, 28, 74 and 79 to 132 and polypeptide codes of SEQ ID Nos. 1 8 to 23, or it may 
identify structural motifs in sequences which are compared to these nucleic acid codes and 
polypeptide codes. In some embodiments, the data storage device may have stored thereon the 
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sequences of at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of SEQ ID Nos. 1 to 
1 7, 24, 25, 28, 74 and 79 to 132 or polypeptide codes of SEQ ID Nos. 1 8 to 23. 

Another aspect of the present invention is a method for determining the level of homology 
between a nucleic acid code of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 and a reference 
nucleotide sequence, comprising the steps of reading the nucleic acid code and the reference 
nucleotide sequence through the use of a computer program which determines homology levels and 
determining homology between the nucleic acid code and the reference nucleotide sequence with 
the computer program. The computer program may be any of a number of computer programs for 
determining homology levels, including those specifically enumerated herein, including BLAST2N 
with the default parameters or with any modified parameters. The method may be implemented 
using the computer systems described above. The method may also be performed by reading 2, 5, 
10, 15, 20, 25, 30, or 50 of the above described nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 132 through use of the computer program and determining homology 
between the nucleic acid codes and reference nucleotide sequences . 

Figure 14 is a flow diagram illustrating one embodiment of a process 250 in a computer 
for determining whether two sequences are homologous. The process 250 begins at a start state 
252 and then moves to a state 254 wherein a first sequence to be compared is stored to a 
memory. The second sequence to be compared is then stored to a memory at a state 256. The 
process 250 then moves to a state 260 wherein the first character in the first sequence is read 
and then to a state 262 wherein the first character of the second sequence is read. It should be 
understood that if the sequence is a nucleotide sequence, then the character would normally be 
either A, T, C, G or U. If the sequence is a protein sequence, then it should be in the single 
letter amino acid code so that the first and sequence sequences can be easily compared. 

A determination is then made at a decision state 264 whether the two characters are the 
same. If they are the same, then the process 250 moves to a state 268 wherein the next 
characters in the first and second sequences are read. A determination is then made whether 
the next characters are the same. If they are, then the process 250 continues this loop until two 
characters are not the same. If a determination is made that the next two characters are not the 
same, the process 250 moves to a decision state 274 to determine whether there are any more 
characters either sequence to read. 

If there aren't any more characters to read, then the process 250 moves to a state 276 
wherein the level of homology between the first and second sequences is displayed to the user. 
The level of homology is determined by calculating the proportion of characters between the 
sequences that were the same out of the total number of sequences in the first sequence. Thus, 
if every character in a first 100 nucleotide sequence aligned with a every character in a second 
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sequence, the homology level would be 100%. 

Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
sequences in order to determine whether the nucleic acid code of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 
74 and 79 to 1 32 differs from a reference nucleic acid sequence at one or more positions. 
Optionally such a program records the length and identity of inserted, deleted or substituted 
nucleotides with respect to the sequence of either the reference polynucleotide or the nucleic acid 
code of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132. In one embodiment, the computer 
program may be a program which determines whether the nucleotide sequences of the nucleic acid 
codes of SEQ ID Nos. 1 to 17,24, 25,28, 74 and 79 to 132 contain a biallelic marker or single 
nucleotide polymorphism (SNP) with respect to a reference nucleotide sequence. This single 
nucleotide polymorphism may comprise a single base substitution, insertion, or deletion, while 
this biallelic marker may comprise abour one to ten consecutive bases substituted, inserted or 
deleted. 

Another aspect of the present invention is a method for determining the level of homology 
between a polypeptide code of SEQ ID Nos. 18 to 23 and a reference polypeptide sequence, 
comprising the steps of reading the polypeptide code of SEQ ID Nos. 1 8 to 23 and the reference 
polypeptide sequence through use of a computer program which determines homology levels and 
determining homology between the polypeptide code and the reference polypeptide sequence using 
the computer program. 

Accordingly, another aspect of the present invention is a method for determining whether a 
nucleic acid code of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 differs at one or more 
nucleotides from a reference nucleotide sequence comprising the steps of reading the nucleic acid 
code and the reference nucleotide sequence through use of a computer program which identifies 
differences between nucleic acid sequences and identifying differences between the nucleic acid 
code and the reference nucleotide sequence with the computer program. In some embodiments, the 
computer program is a program which identifies single nucleotide polymorphisms. The method 
may be implemented by the computer systems described above and the method illustrated in Figure 
14. The method may also be performed by reading at least 2, 5, 10, 15, 20, 25, 30, or 50 of the 
nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 and the reference 
nucleotide sequences through the use of the computer program and identifying differences between 
the nucleic acid codes and the reference nucleotide sequences with the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 
1 7, 24, 25, 28, 74 and 79 to 132 or the amino acid sequences of the polypeptide codes of SEQ ID 
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Nos. 18 to 23. 

An "identifier" refers to one or more programs which identifies certain features within 
the above-described nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 132 or the amino acid sequences of the polypeptide codes of SEQ ID Nos. 
1 8 to 23. In one embodiment, the identifier may comprise a program which identifies an open 
reading frame in the cDNAs codes of SEQ ID Nos 2 to 1 7 and 36 to 40. 

Figure 15 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 
and then moves to a state 304 wherein a first sequence that is to be checked for features is 
stored to a memory 1 15 in the computer system 100. The process 300 then moves to a state 306 
wherein a database of sequence features is opened. Such a database would include a list of each 
feature's attributes along with the name of the feature. For example, a feature name could be 
"Initiation Codon" and the attribute would be "ATG". Another example would be the feature 
name "TAATAA Box" and the feature attribute would be "TAATAA". An example of such a 
database is produced by the University of Wisconsin Genetics Computer Group 
(www.gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a 
state 308 wherein the first feature is read from the database. A comparison of the attribute of 
the first feature with the first sequence is then made at a state 3 10. A determination is then 
made at a decision state 316 whether the attribute of the feature was found in the first sequence. 
If the attribute was found, then the process 300 moves to a state 3 1 8 wherein the name of the 
found feature is displayed to the user. 

The process 300 then moves to a decision state 320 wherein a determination is made 
whether move features exist in the database. If no more features do exist, then the process 300 
terminates at an end state 324. However, if more features do exist in the database, then the 
process 300 reads the next sequence feature at a state 326 and loops back to the state 310 
wherein the attribute of the next feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 
decision state 3 1 6, the process 300 moves directly to the decision state 320 in order to 
determine if any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program 
which determines the 3-dimensional structure of the polypeptides codes of SEQ ID Nos. 18 to 
23. In some embodiments, the molecular modeling program identifies target sequences that are 
most compatible with profiles representing the structural environments of the residues in known 
three-dimensional protein structures. (See, e.g., Eisenberg et al., U.S. Patent No. 5,436,850 
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issued July 25, 1995). In another technique, the known three-dimensional structures of proteins 
in a given family are superimposed to define the structurally conserved regions in that family. 
This protein modeling technique also uses the known three-dimensional structure of a 
homologous protein to approximate the structure of the polypeptide codes of SEQ ID Nos. 4 to 
8. (See e.g., Srinivasan, et al., U.S. Patent No. 5,557,535 issued September 17, 1996). 
Conventional homology modeling techniques have been used routinely to build models of 
proteases and antibodies. (Sowdhamini et al., Protein Engineering 10:207, 215 (1997)). 
Comparative approaches can also be used to develop three-dimensional protein models when 
the protein of interest has poor sequence identity to template proteins. In some cases, proteins 
fold into similar three-dimensional structures despite having very weak sequence identities. For 
example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 
template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 
performed using Multiple Sequence Threading (MST), structural equivalencies are deduced 
from the threading output using a distance geometry program DRAGON to construct a low 
resolution model, and a full-atom representation is constructed using a molecular modeling 
package such as QUANTA. 

According to this 3-step approach, candidate templates are first identified by using the 
novel fold recognition algorithm MST, which is capable of performing simultaneous threading 
of multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints 
and fed into the distance geometry program DRAGON, together with auxiliary information 
obtained from secondary structure predictions. The program combines the restraints in an 
unbiased manner and rapidly generates a large number of low resolution model confirmations. 
In a third step, these low resolution model confirmations are converted into full-atom models 
and subjected to energy minimization using the molecular modeling package QUANTA. (See 
e.g., Aszodi et al., Proteins: Structure, Function, and Genetics, Supplement 1:38-42 (1997)). 

The results of the molecular modeling analysis may then be used in rational drug design 
techniques to identify agents which modulate the activity of the polypeptide codes of SEQ ID 
Nos. 18 to 23. 

Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 or the 
polypeptide codes of SEQ ID Nos. 1 8 to 23 comprising reading the nucleic acid code(s) or the 
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polypeptide code(s) through the use of a computer program which identifies features therein and 
identifying features within the nucleic acid code(s) or polypeptide code(s) with the computer 
program. In one embodiment, computer program comprises a computer program which 
identifies open reading frames. In a further embodiment, the computer program identifies 
5 structural motifs in a polypeptide sequence. In another embodiment, the computer program 

comprises a molecular modeling program. The method may be performed by reading a single 
sequence or at least 2, 5, 10, 15, 20,25,30, or 50 of the nucleic acid codes of SEQ ID Nos. 1 to 
17, 24, 25, 28, 74 and 79 to 132 or the polypeptide codes of SEQ ID Nos. 18 to 23 through the 
use of the computer program and identifying features within the nucleic acid codes or 
1 0 polypeptide codes with the computer program. 

The nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the 
polypeptide codes of SEQ ID Nos. 18 to 23 may be stored and manipulated in a variety of data 
processor programs in a variety of formats. For example, the nucleic acid codes of SEQ ID Nos. 1 
to 17, 24, 25, 28, 74 and 79 to 132 or the polypeptide codes of SEQ ID Nos. 18 to 23 may be 
1 5 stored as text in a word processing file, such as MicrosoftWORD or WORDPERFECT or as an 

ASCII file in a variety of database programs familiar to those of skill in the art, such as DB2, 
SYBASE, or ORACLE. In addition, many computer programs and databases may be used as 
sequence comparers, identifiers, or sources of reference nucleotide or polypeptide sequences to be 
compared to the nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the 
20 polypeptide codes of SEQ ID Nos. 18 to 23. The following list is intended not to limit the 

invention but to provide guidance to programs and databases which are useful with the nucleic acid 
codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the polypeptide codes of SEQ ID 
Nos. 18 to 23. The programs and databases which may be used include, but are not limited to: 
MacPattern (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine (Molecular 
25 Applications Group), Look (Molecular Applications Group), MacLook (Molecular Applications 

Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul et al, J. Mol Biol 215: 
403 (1990)), FASTA (Pearson and Lipman, Proa Natl Acad Set. USA, 85: 2444 (1988)), 
FASTDB (Brutlag et al. Comp. App. Biosci. 6:237-245, 1990), Catalyst (Molecular Simulations 
Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius 2 .DBAccess (Molecular Simulations 
30 Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular Simulations Inc.), Discover 

(Molecular Simulations Inc.), CHARMm (Molecular Simulations Inc.), Felix (Molecular 
Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, (Molecular Simulations Inc.), 
Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular 
Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.), WebLab (Molecular 
35 Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
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(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein 
database, the MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwents's World Drug Index database, the 
BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other 
programs and data bases would be apparent to one of skill in the art given the present disclosure. 

Motifs which may be detected using the above programs include sequences encoding 
leucine zippers, helix-turn-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, 
and beta sheets, signal sequences encoding signal peptides which direct the secretion of the 
encoded proteins, sequences implicated in transcription regulation such as homeoboxes, acidic 
stretches, enzymatic active sites, substrate binding sites, and enzymatic cleavage sites. 

Throughout this application, various publications, patents, and published patent 
applications are cited. The disclosures of the publications, patents, and published patent 
specifications referenced in this application are all hereby incorporated by reference in their 
entireties into the present disclosure to more fully describe the state of the art to which this 
invention pertains. 

EXAMPLES 

Several of the methods of the present invention are described in the following 
examples, which are offered by way of illustration and not by way of limitation. Many other 
modifications and variations of the invention as herein set forth can be made without departing 
from the spirit and scope thereof and therefore only such limitations should be imposed as are 
indicated by the appended claims. 

Example 1 

Identification Of Biallelic Markers - DNA Extraction 

Donors were unrelated and healthy. They presented a sufficient diversity for being 
representative of a heterogeneous population. The DNA from 100 individuals was extracted 
and tested for the detection of the biallelic markers. 

30 ml of peripheral venous blood were taken from each donor in the presence of EDTA. 
Cells (pellet) were collected after centrifugation for 10 minutes at 2000 rpm. Red cells were 
lysed by a lysis solution (50 ml final volume: 10 mM Tris pH7.6; 5 mM MgCl2; 10 mM NaCl). 
The solution was centrifuged (10 minutes, 2000 rpm) as many times as necessary to eliminate 
the residual red cells present in the supernatant, after resuspension of the pellet in the lysis 
solution. 

The pellet of white cells was lysed overnight at 42°C with 3.7 ml of lysis solution 
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composed of: 

- 3 ml TE 10-2 (Tris-HCl 1 0 mM, EDTA 2 mM) / NaCI 0 4 M 
-200^1 SDS 10% 

- 500 \i\ K-proteinase (2 mg K-proteinase in TE 10-2 / NaCI 0.4 M). 

5 

For the extraction of proteins, 1 ml saturated NaCI (6M) (1/3.5 v/v) was added. After 
vigorous agitation, the solution was centrifuged for 20 minutes at 10000 rpm. 

For the precipitation of DNA, 2 to 3 volumes of 100% ethanol were added to the 
previous supernatant, and the solution was centrifuged for 30 minutes at 2000 rpm. The DNA 
10 solution was rinsed three times with 70% ethanol to eliminate salts, and centrifuged for 20 

minutes at 2000 rpm. The pellet was dried at 37°C, and resuspended in 1 ml TE 10-1 or I ml 
water. The DNA concentration was evaluated by measuring the OD at 260 nm ( 1 unit OD = 50 
Hg/ml DNA). To determine the presence of proteins in the DNA solution, the OD 260 / OD 280 
ratio was determined. Only DNA preparations having a OD 260 / OD 280 ratio between 1 .8 
15 and 2 were used in the subsequent examples described below. 

The pool was constituted by mixing equivalent quantities of DNA from each individual. 

Example 2 

Identification Of Biallelic Markers: Amplification Of Genomic DNA By PCR 

20 The amplification of specific genomic sequences of the DNA samples of Example 1 

was carried out on the pool of DNA obtained previously. In addition, 50 individual samples 
were similarly amplified. 

PCR assays were performed using the following protocol: 



Finai volume 25 |il 

25 DNA 2 ng/nl 

MgCl 2 2 mM 

dNTP (each) 200 \iM 

primer (each) 2.9 ng/^1 

Ampli Taq Gold DNA polymerase 0.05 unit/^l 

30 PCR buffer (lOx = 0.1 M TrisHCl pH8.3 0.5M KC1) lx 



Each pair of first primers was designed using the sequence information of genomic 
DNA sequences of SEQ ID Nos 1 to 78 disclosed herein and the OSP software (Hillier & 
Green, 1991). This first pair of primers was about 20 nucleotides in length and had the 
35 sequences disclosed in Table 4 in the columns labeled "Position range of amplification primer 



WO 01/40493 



PCT/TB00/01507 



154 



in SEQ ID No." and "Complementary position range of amplification primer in SEQ ID No. 3 



Table 4 



Amplicon 


SEQ 


Primer 
name 


Position range of 
amplification primer 
in SEQ ED 


Primer 
name 


Complementary 
position range of 
amplification primer 
in SEQ ID 


99-27943 


1 


Bl 


7938 


7958 


CI 


8446 


8465 


99-27935 


1 


B2 


21365 


21385 


C2 


21845 


21864 


8-128 


1 


B3 


65453 


65471 


C3 


65856 


65874 


99-31960 


1 


B4 


95034 


95053 


C4 


95543 


95563 


99-24656 


1 


B5 


107022 


107040 


C5 


107495 


107513 


99-24634 


1 


B6 


160770 


160787 


C6 


161240 


161257 


99-24639 


1 


B7 


160279 


160298 


C7 


160785 


160802 


99-7652 


1 


B8 


168813 


168830 


C8 


169331 


169351 


99-16100 


1 


B9 


170666 


170686 


C9 


171153 


171173 


99-5862 


1 


B10 


173065 


173085 


C10 


173495 


173514 


99-5919 


1 


Bll 


189753 


189771 


Cll 


190187 


190207 


99-24658 


1 


B12 


197116 


197135 


C12 


197555 


197572 


8-303 


1 


BI3 


200587 


200606 


C13 


200994 


201012 


8-300 


1 


B14 


202440 


202459 


C14 


202852 


202871 


8-299 


1 


B15 


203061 


203080 


C15 


203486 


203505 


8-296 


1 


B16 


203943 


203962 


C16 


204345 


204362 


99-24644 


1 


B17 


204741 


204758 


C17 


205222 


205240 


8-252 


1 


B18 


204373 


204392 


C18 


204777 


204794 


8-295 


1 


B19 


205034 


205052 


C19 


205435 


205453 


8-293 


1 


B20 


205768 


205786 


C20 


206175 


206193 


8-292 


1 


B21 


206316 


206333 


C21 


206725 


206742 


8-251 




B22 


207186 


207205 


C22 


207616 


207634 


8-289 




B23 


208186 


208205 


C23 


208588 


208606 


8-287 




B24 


208786 


208805 


C24 


209189 


209208 


8-285 




B25 


209520 


209539 


C25 


209931 


209949 


8-283 




B26 


210219 


210237 


C26 


210620 


210638 


8-282 




B27 


210801 


210820 


C27 


211205 


211223 


8-281 




B28 


211182 


211199 


C28 


211596 


211613 
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8-279 


1 


B29 


212290 


212309 


C29 


212698 


212716 


8-278 


1 


B30 


212662 


212679 


C30 


213091 


213109 


8-277 


1 


B31 


213783 


213803 


C31 


214062 


214082 


8-124 


1 


B32 


214564 


214581 


C32 


214965 


214983 


8-273 


1 


B33 


215310 


215328 


C33 


215711 


215729 


8-126 


1 


B34 


216687 


216704 


C34 


217082 


217101 


8-130 


1 


B35 


216683 


216702 


C35 


217091 


217109 


99-5897 


1 


B36 


219408 


219425 


C36 


219882 


219899 


99-24649 


1 


B37 


220505 


220522 


C37 


221004 


221021 


8-247 


1 


B38 


223011 


223028 


C38 


223549 


223566 


8-246 


1 


B39 


223962 


223978 


C39 


224588 


224605 


8-245 


1 


B40 


224564 


224581 


C40 


225044 


225061 


8-119 


1 


B41 


230210 


230227 


C41 


230622 


230641 


99-15870 


1 


B42 


231278 


231298 


C42 


231729 


231747 


99-16087 


1 


B43 


235081 


235101 


C43 


235515 


235533 


8-155 


1 


B44 


238607 


238626 


C44 


239029 


239046 


99-16038 


1 


B45 


239405 


239425 


C45 


239862 


239880 


8-136 


1 


B46 


239606 


239624 


C46 


240012 


240029 


99-15880 


1 


B47 


241700 


241717 


C47 


242151 


242171 


8-271 


1 


B48 


243484 


243502 


C48 


243909 


243926 


8-267 


1 


B49 


245105 


245123 


C49 


245518 


245535 


8-266 


1 


B50 


245372 


245389 


C50 


245775 


245793 


8-265 


1 


B51 


245754 


245772 


C51 


246155 


246173 


99-25940 


1 


B52 


247513 


247533 


C52 


248023 


248043 


99-16032 


1 


B53 


248204 


248223 


C53 


248588 


248606 


99-16105 


1 


B54 


255697 


255715 


C54 


256133 


256152 


99-16033 


1 


B55 


259885 


259902 


C55 


260324 


260342 


99-15875 




B56 


279626 


279644 


C56 


280154 


280173 


99-16052 




B57 


300153 


300170 


C57 


300660 


300680 


99-16047 




B58 


311615 


311632 


C58 


312126 


312144 


99-25993 




B59 


315649 


315668 


C59 


316129 


316147 
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Amplicon 


SEQ 

roNo 


Primer 
name 


Position range of 
amplification primer 
in SEQ ID 


Primer 
name 


Complementary 
position range of 
amplification primer 
in SEQ ID 


8-102 


24 


B60 


1 


20 


C60 


408 


427 


8-104 


25 


B61 


1 


21 


C61 


295 


310 


8-94 


26 


B62 


1 


18 


C62 


400 


418 


8-95 


27 


B63 


1 


18 


C63 


400 


419 


8-97 


28 


B64 


1 


18 


C64 


331 


350 


8-98 


29 


B65 


1 


19 


C65 


415 


434 


99-14021 


30 


B66 


1 


18 


C66 


465 


477 


99-14364 


31 


B67 


64 


82 


C67 


434 


454 


99-15056 


32 


B68 


1 


18 


C68 


482 


502 


99-15229 


33 


B69 


1 


20 


C69 


476 


494 


99-15232 


34 


B70 


1 


18 


C70 


467 


485 


99-15253 


35 


B71 


1 


19 


C71 


459 


477 


99-15256 


36 


B72 


1 


18 


C72 


439 


456 


99-15258 


37 


B73 


1 


19 


C73 . 


521 


541 


99-15261 


38 


B74 


1 


19 


C74 


481 


501 


99-15280 


39 


B75 


1 


18 


C75 


521 


541 


99-15355 


40 


B76 


1 


18 


C76 


471 


489 


99-15663 


41 


B77 


1 


18 


C77 


430 


450 


99-15664 


42 


B78 


1 


19 


C78 


483 


502 


99-15665 


43 


B79 


1 


20 


C79 


458 


476 


99-15668 


44 


B80 


1 


18 


C80 


439 


459 


99-15672 


45 


B8I 


1 


18 


C81 


533 


547 


99-15682 


46 


B82 


1 


19 


C82 


482 


500 


99-16081 


47 


B83 


1 


18 


C83 


443 


462 


99-16082 


48 


B84 




18 


C84 


512 


532 


99-20933 


49 


B85 




19 


C85 


432 


45 i 


99-20977 


50 


B86 




18 


C86 


491 


511 


99-20978 


51 


B87 




19 


C87 


446 


466 


99-20981 


52 


B88 




18 


C88 


429 


449 


99-20983 


53 


B89 




19 


C89 


431 


449 


99-22310 


54 


B90 




19 


C90 


446 


466 
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99-25922 


55 


B91 


1 


21 


C91 


451 


471 


99-25950 


56 


B92 


1 


19 


C92 


479 


499 


99-25961 


57 


B93 


1 


20 


C93 


464 


484 


99-25965 


58 


B94 


1 


21 


C94 


451 


470 


99-25966 


59 


B95 




21 


C95 


503 


523 


99-25967 


60 


B96 




20 


C96 


473 


493 


99-25969 


61 


B97 




21 


C97 


510 


530 


99-25972 


62 


B98 


1 


21 


C98 


428 


448 


99-25974 


63 


B99 


1 


21 


C99 


524 


544 


99-25977 


64 


B100 


1 


21 


C100 


520 


540 


99-25979 


65 


B101 


1 


19 


C101 


518 


538 


99-25985 


66 


B102 


1 


21 


C102 


449 


469 


99-25989 


67 


B103 


1 


19 


C103 


533 


553 


99-26146 


68 


B104 


1 


19 


C104 


436 


456 


99-26150 


69 


B105 


1 


19 


C105 


437 


454 


99-26153 


70 


B106 


1 


19 


C106 


429 


449 


99-26156 


71 


B107 


1 


18 


C107 


491 


511 


99-27345 


72 


B108 


1 ■ 


18 


C108 


531 


551 


99-30329 


73 


B109 


1 


18 


C109 


496 


514 


99-31798 


74 


Bl 10 


1 


20 


C1I0 


491 


511 


99-5873 


75 


Bill 


11 


28 


cm 


466 


474 


99-6012 


76 


Bl 12 


11 


29 


CI 12 


477 


490 


99-6080 


77 


Bl 13 


11 


28 


CI 13 


519 


529 


99-7308 


78 


Bl 14 


1 


18 


CI 14 


469 


489 



Preferably, the primers contained a common oligonucleotide tail upstream of the 
specific bases targeted for amplification which was useful for sequencing. 

Primers from the column labeled "Position range of amplification primer in SEQ ID 
No." contain the following additional PU 5' sequence: TGTAAAACGACGGCCAGT (SEQ ID 
No. 133); primers from the column labeled "Complementary position range of amplification 
primer in SEQ ID No." contain the following RP 5* sequence: CAGGAAACAGCTATGACC 
(SEQ ID No. 134). 

The synthesis of these primers was performed following the phosphoramidite method, on a 
GENSET UFPS 24.1 synthesizer. 
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DNA amplification was performed on a Genius II thermocycler. After heating at 95°C 
for 10 min, 40 cycles were performed. Each cycle comprised: 30 sec at 95°C S 54°C for 1 min, 
and 30 sec at 72°C. For final elongation, 1 0 min at 72°C ended the amplification. The 
quantities of the amplification products obtained were determined on 96-well microtiter plates, 
using a fluorometer and Picogreen as intercalant agent (Molecular Probes). 

Example 3 
Identification of Polymorphisms 
a) Identification of Biallelic Markers from Amplified Genomic DNA of Example 2 
The sequencing of the amplified DNA obtained in Example 2 was carried out on ABI 
377 sequencers. The sequences of the amplification products were determined using automated 
dideoxy terminator sequencing reactions with a dye terminator cycle sequencing protocol. The 
products of the sequencing reactions were run on sequencing gels and the sequences were 
determined using gel image analysis (ABI Prism DNA Sequencing Analysis software (2.1.2 
version)). 

The sequence data were further evaluated to detect the presence of biallelic markers 
within the amplified fragments. The polymorphism search was based on the presence of 
superimposed peaks in the electrophoresis pattern resulting from different bases occurring at the 
same position as described previously. 

The localization of the biallelic markers detected in the fragments of amplification are 
as shown below in Table 5. 

Table 5 
Biallelic Markers 



Amplicon 


BM 


Marker 
Name 


Polymor- 
phism 
A1I1 1 AI12 


SEQ 
ID No. 


BM 

position 
in SEQID 


Position of 
probes in 
SEQ ID No. 


Probe 
No. 


99-27943 


Al 


99-27943-150 


G 


C 




8316 


8304 


8328 


PI 


99-27935 


A2 


99-27935-193 


G 


C 




21672 


21660 


21684 


P2 


8-128 


A3 


8-128-33 


C 


T 




65485 


65473 


65497 


P3 


99-31960 


A4 


99-31960-363 


A 


G 




95396 


95384 


95408 


P4 


99-24656 


A5 


99-24656-260 


A 


G 




107281 


107269 


107293 


P5 


99-24634 


A6 


99-24639-163 


A 


C 




160640 


160628 


160652 


P6 


99-24639 


A7 


99-24634-108 


A 


T 




160876 


160864 


160888 


P7 


99-7652 


A8 


99-7652-162 


A 


G 




168974 


168962 


168986 


P8 


99-16100 


A9 


99-16100-147 


A 


G 




170810 


170798 


170822 


P9 


99-5862 


A10 


99-5862-167 


A 


G 




173358 


173346 


173370 


P10 



WO 01/40493 



159 



PCT/TB00/01507 



99-5919 


All 


99-5919-215 


A 


G 


1 


189957 


189945 


189969 


Pll 


99-24658 


A12 


99-24658-410 


A 


G 


I 


197163 


197151 


197175 


P12 


8-303 


A13 


8-303-235 


A 


G 


1 


200778 


200766 


200790 


P13 


8-300 


A14 


8-300-221 


A 


G 


1 


202651 


202639 


202663 


P14 


8-300 


A15 


8-300-193 


A 


G 


1 


202679 


202667 


202691 


P15 


8-299 


A16 


8-299-128 


A 


T 


1 


203378 


203366 


203390 


P16 


8-296 


A17 


8-296-213 


A 


T 


1 


204138 


204126 


204150 


P17 


8-252 


A18 


8-252-190 


C 


T 


1 


204605 


204593 


204617 


P18 


99-24644 


A19 


99-24644-194 


A 


G 


1 


204934 


204922 


204946 


P19 


8-295 


A20 


8-295-248 


A 


C 


1 


205206 


205194 


205218 


P20 


8-295 


A21 


8-295-125 


C 


T 


1 


205329 


205317 


205341 


P21 


8-293 


A22 


8-293-130 


A 


G 


1 


206064 


206052 


206076 


P22 


8-292 


A23 


8-292-198 


A 


G 


1 


206545 


206533 


206557 


P23 


8-251 


A24 


8-251-322 


A 


G 


1 


207313 


207301 


207325 


P24 


8-289 


A25 


8-289-322 


A 


G 


1 


208285 


208273 


208297 


P25 


8-287 


A26 


8-287-249 


C 


T 


1 


208960 


208948 


208972 


P26 


8-287 


A27 


8-287-86 


A 


T 


1 


209123 


209111 


209135 


P27 


8-285 


A28 


8-285-319 


A 


G 


1 


209631 


209619 


209643 


P28 


8-283 


A29 


8-283-278 


G 


C 


1 


210361 


210349 


210373 


P29 


8-283 


A30 


8-283-176 


A 


G 


1 


210463 


210451 


210475 


P30 


8-283 


A31 


8-283-153 


G 


C 


1 


210486 


210474 


210498 


P31 


8-283 


A32 


8-283-56 


C 


T 




210583 


210571 


210595 


P32 


8-282 


A33 


8-282-345 


G 


C 


j 


210879 


210867 


210891 


P33 


8-282 


A34 


8-282-260 


G 


T 


1 


210964 


210952 


210976 


P34 


8-282 


A3 5 


8-282-245 


A 


C 


1 


210979 


210967 


210991 


P35 


8-282 


A36 


8-282-174 


Variable 
motif 


1 


211050 






P36 


8-282 


A3 7 


8-282-92 


A 


T 


1 


211132 


211120 


211144 


P37 


8-281 


A3 8 


8-281-367 


A 


G 


1 


211247 


211235 


211259 


P38 


8-281 


A39 


8-281-299 


A 


G 




211315 


211303 


211327 


P39 


8-281 


A40 


8-281-248 


G 


C 




211366 


211354 


211378 


P40 


8-279 


A41 


8-279-197 


A 


C 




212520 


212508 


212532 


P41 


8-278 


A42 


8-278-289 < 


•> 


T 




212821 


212809 


212833 


P42 


8-277 


A43 


8-277-250 < 


—i 


T 




213833 


213821 


213845 


P43 
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8-124 



8-273 



A44 8-124-294 



A45 8-273-383 



214857 



215347 



214845 



215335 



214869 



215359 



P44 



P45 



8-130/ 
8-126 



99-5897 



99-24649 



A46 8-130-143/ 
8-126-286 



A47 199-5897-143 A 



A48 [ 99-24649-186 |A 



216967 



216955 



219540 



219528 



220836 



220824 



216979 



P46 



219552 



P47 



220848 



P48 



8-247 



A49 8-247-337 



223230 



223218 



223242 



P49 



8-246 
8-245 



A50 8-246-393 



224213 



224201 



224225 



P50 



A51 8-245-270 



224792 



224780 



224804 



P51 



8-119 



99-15870 



A52 8-1 19-247 



A53 99-15870-400 |A 



230395 



230383 



230407 



P52 



231677 



231665 



231689 



P53 



99-16087 



A54 99-16087-219 G 



235315 



235303 



235327 



P54 



8-155 
99-16038 



A55 8-155-258 



238789 



238777 



238801 



P55 



8-136 
99-15880 



A56 99-16038-1 18|C 



A57 8-136-267 



239763 



23975 1 



239775 



P56 



239763 



239751 



239775 



P57 



8-271 
8-267 



A58 99-15880-162 A 



A59 8-271-227 



241861 



241849 



241873 



P58 



243700 



243688 



243712 



P59 



A60 8-267-273 



Deletion 



245263 



P60 



8-266 



A61 8-266-131 



245663 



245651 



245675 



P6I 



8-265 



8-265 



A62 8-265-208 



A63 8-265-203 



245966 



245954 



245978 



P62 



245971 



245959 



245983 



P63 



99-25940 
99-16032 



A64 99-25940- 182TC 



247864 



247852 



247876 



P64 



A65 99-16032-292 G 



99-16105 
99-16033 



A66 99-16105-152 A 



248315 



248303 



248327 



P65 



255848 



255836 



255860 



P66 



A67 99-16033-244 |A 



260099 



260087 



260111 



P67 



99-15875 



99-16052 



99-16047 



A68 99-15875-165 |C 



A69 99-16052-214 A 



A70 99-16047-1 15 A 



279789 



279777 



279801 



P68 



300365 



300353 



300377 



P69 



312030 



12018 



12042 



P70 



99-25993 



A71 99-25993-367 A 



316014 



16002 



16026 



P71 



Amplicon 



BM 



Marker 
Name 



Polymor- 
I phism 



SEQ 

roNo. 



BM 

position 



Alll I A112 



in 



SEQID 



Position of 
probes in 
SEQ I D No. 



Probe | 
No. 



8-102 



A72 8-102-231 



24 



231 



219 



243 



P72 



8-104 



A73 8-104-105 



25 



105 



93 



117 



P73 
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A74 8-94-252 



A75 8-95-43 



26 



27 



251 



239 



263 



43 



31 



55 



P74 



P75 



8-97 
8-98 



A76 8-97-181 



28 



181 



169 



193 



A77 8-97-98 



28 



98 



86 



110 



P76 



P77 



A78 8-98-68 



29 



68 



56 



80 



P78 



99-14021 
99-14364 



A79 199-14021-108 A 



30 



108 



96 



120 



P79 



99-15056 
99-15229 



A80 99-14364-415 C 



31 



A81 99-15056-99 



32 



415 



403 



427 



99 



87 



111 



P80 



P81 



A82 99-15229-412 I A 



33 



412 



400 



424 



P82 



99-15232 



99-15253 



A83 99-15232-291 G 



34 



291 



279 



303 



A84 99-15253-382 C 



35 



382 



370 



394 



P83 



P84 



99-15256 
99-15258 



A85 99-15256-392 1 C 



36 



392 



380 



404 



P85 



A86 99-15258-337 G 



37 



337 



325 



349 



P86 



99-15261 



A87 99-15261-202 A 



38 



200 



188 



212 



P87 



99-15280 



A88 99-15280-432 I C 



39 



432 



420 



444 



P88 



99-15355 
99-15663 



A89 99-15355-150 1 C 



40 



150 



138 



162 



P89 



A90 99-15663-298 C 



41 



298 



286 



310 



P90 



99-15664 
99-15665 



A91 99-15664-185 G 



42 



185 



173 



197 



P91' 



A92 99-15665-398|A 



43 



398 



386 



410 



P92 



99-15668 



A93 99-15668-139 C 



44 



139 



127 



151 



P93 



99-15672 



A94 99-15672-166 C 



45 



166 



154 



178 



P94 



99-15682 
99-16081 



A95 99-15682-318 A 



46 



318 



306 



330 



P95 



A96 99-16081-217 \C 



47 



217 



205 



229 



P96 



99-16082 
99-20933 



A97 99-16082-218 I A 



48 



218 



206 



230 



P97 



A98 99-20933-81 



49 



81 



69 



93 



P98 



99-20977 



99-20978 



A99 99-20977-72 



50 



71 



59 



83 



A 100 99-20978-89 



51 



89 



77 



101 



P99 



PI 00 



99-20981 



A101 99-20981-300 A 



52 



300 



288 



312 



P101 



99-20983 



A 102 99-20983-48 



53 



48 



36 



60 



P102 



99-22310 



A 103 99-223 10-148TC 



54 



148 



136 



160 P103 



99-25922 



A 104 99-25922-147 G 



55 



147 



135 



159 P104 



99-25950 



A 105 99-25950-121 I G 



56 



121 



109 



133 P105 



99-25961 



A106 99-25961-376TA 



57 



373 



361 



385 



P106 



99-25965 



A107 99-25965-399IA 



58 



399 



387 



411 



P107 



WO 01/40493 



PCT/IB00/01507 



162 



99-25966 


A108 


99-25966-241 


A 


G 


59 


241 


229 


253 


PI 08 


99-25967 


A109 


99-25967-57 


A 


G 


60 


56 


44 


68 


PI 09 


99-25969 


A110 


99-25969-200 


G 


T 


61 


200 


188 


212 


PI 10 


99-25972 


Alll 


99-25972-317 


C 


T 


62 


315 


303 


327 


Pill 


99-25974 


Al 12 


99-25974-143 


A 


G 


63 


143 


131 


155 


PI 12 


99-25977 


Al 13 


99-25977-311 


A 


G 


64 


311 


299 


323 


PI 13 


99-25979 


A114 


99-25979-93 


A 


G 


65 


93 


81 


105 


PI 14 


99-25985 


Al 15 


99-25985-194 


C 


T 


66 


194 


182 


206 


PI 15 


99-25989 


Al 16 


99-25989-398 


A 


G 


67 


398 


386 


410 


PI 16 


99-26146 


A117 


99-26146-264 


G 


T 


68 


264 


252 


276 


PI 17 


99-26150 


Al 18 


99-26150-276 


A 


G 


69 


276 


264 


288 


PI 18 


99-26153 


Al 19 


99-26153-44 


A 


C 


70 


44 


32 


56 


P119 


99-26156 


A120 


99-26156-290 


A 


C 


71 


290 


278 


302 


P120 


99-27345 


A121 


99-27345-189 


G 


C 


72 


189 


177 


201 


P121 


99-30329 


A122 


99-30329-380 


C 


T 


73 


380 


368 


392 


P122 


99-31798 


A 123 


99-31798-344 


Variable 
motif 


74 


342 






P123 


99-5873 


A124 


99-5873-159 


C 


T 


75 


159 


147 


171 


PI 24 


99-6012 


A125 


99-6012-220 


G 


T 


76 


220 


208 


232 


P125 


99-6080 


A126 


99-6080-99 


C 


T 


77 


99 


87 


111 


P126 


99-7308 


A127 


99-7308-157 


C 


T 


78 


156 


144 


168 


P127 



BM refers to "biallelic marker". Alll and alI2 refer respectively to allele I and allele 2 
of the biallelic marker. 

Certain biallelic markers of the invention are insertions or deletions, as indicated above. 
In particular, the biallelic marker 8-282-174 comprises a variable motif AAAGG or 
GAAGGAAGGAAGGAAGGAAGA. The biallelic marker 8-267-273 comprises an insertion 
or deletion of GTGGGC. The biallelic marker 99-3 1798-344 comprises a variable motif CAT 
or ACTGTCTCTACCTCA 

Example 4 

Validation Of The Polymorphisms Through Microsequencing 

The biallelic markers identified in Example 3 were further confirmed and their 
respective frequencies were determined through microsequencing. Microsequencing was 
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carried out for each individual DNA sample described in Example 1. 

Amplification from genomic DNA of individuals was performed by PCR as described 
above for the detection of the biailelic markers with the same set of PCR primers (Table 3). 

The preferred primers used in microsequencing were about 19 nucleotides in length and 
hybridized just upstream of the considered polymorphic base. According to the invention, the 
primers used in microsequencing are detailed in Table 6. 



Table 6 



Marker Name 


Biailelic 
Marker 

a* ami nwi 


SEQ 
ID No. 


Mis. 1 


Position range of 
microsequencing 
primer mis. 1 in 
SEQ ID No. 


Mis. 2 


Complementary 
position range of 
microsequencing 
primer mis. 2 in 
SEQ ID No. 


99-27943-150 


Al 


1 


Dl 


8297 


8315 


El 


8317 


8335 


99-27935-193 


A2 


1 


D2 


21653 


21671 


E2 


21673 


21691 


8-128-33 


A3 


1 


D3 


65466 


£ C A a A 

65484 


E3 


65486 


65504 


99-31960-363 


A4 


1 


D4 


95377 


95395 


E4 


95397 


95415 


99-24656-260 


A5 


1 


D5 


1 07262 


107280 


E5 


107282 


107300 


99-24639-163 


A6 


1 


D6 


1 6062 1 


160639 


E6 


160641 


160659 


99-24634-108 


A7 


1 


D7 


160857 


160875 


E7 


1 60877 


160895 


99-7652-162 


A8 


1 


D8 


168955 


168973 


E8 


168975 


168993 


99-16100-147 


A9 


1 


D9 


1 /U791 


1 1 f\o f\C\ 

170809 


E9 


170811 


170829 


99-5862-167 


A10 


1 


D10 


1 / JJJ7 


i /jjj / 


E10 


1 Til CO 


1 73377 


99-5919-215 


All 


1 


Dll 


189938 


189956 


Ell 


189958 


189976 


99-24658-410 


A12 


1 


D12 


197144 


197162 


E12 


197164 


197182 


8-303-235 


A13 


1 


D13 


200759 


200777 


E13 


200779 


200797 


8-300-221 


A14 




D14 


202632 


202650 


E14 


202652 


202670 


8-300-193 


A15 




D15 


202660 


202678 


E15 


202680 


202698 


8-299-128 


A16 




D16 


203359 


203377 


E16 


203379 


203397 


8-296-213 


A17 




D17 


204119 


204137 


E17 


204139 


204157 


8-252-190 


A18 




D18 


204586 


204604 


E18 


204606 


204624 


99-24644-194 


A19 




D19 


204915 


204933 


E19 


204935 


204953 


8-295-248 


A20 




D20 


205187 


205205 


E20 


205207 


205225 


8-295-125 


A21 




D21 


205310 


205328 


E21 


205330 


205348 


8-293-130 


A22 




D22 


206045 


206063 


E22 


206065 


206083 


8-292-198 


A23 




D23 


206526 


206544 


E23 


206546 


206564 


8-251-322 


A24 




D24 


207294 


207312 


E24 


207314 


207332 
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8-289-322 


A25 


1 


D25 


208266 


208284 


E25 


208286 


208304 


8-287-249 


A26 




D26 


208941 


208959 


E26 


208961 


208979 


8-287-86 


A27 


1 


D27 


209104 


209122 


E27 


209124 


209142 


8-285-319 


A28 


1 


D28 


209612 


209630 


E28 


209632 


209650 


8-283-278 


A29 


1 


D29 


210342 


210360 


E29 


210362 


210380 


8-283-176 


A30 


1 


D30 


210444 


210462 


E30 


210464 


210482 


8-283-153 


A31 


1 


D31 


210467 


210485 


E31 


210487 


210505 


8-283-56 


A32 


1 


D32 


210564 


210582 


E32 


210584 


210602 


8-282-345 


A33 


1 


D33 


210860 


210878 


E33 


210880 


210898 


8-282-260 


A34 


1 


D34 


210945 


210963 


E34 


210965 


210983 


8-282-245 


A35 


1 


D35 


210960 


210978 


E35 


210980 


210998 


8-282-174 


A36 


1 


D36 






E36 






8-282-92 


A37 


1 


D37 


211113 


211131 


E37 


211133 


211151 


8-281-367 


A38 


1 


D38 


211228 


211246 


E38 


211248 


211266 


8-281-299 


A39 


1 


D39 


211296 


211314 


E39 


211316 


211334 


8-281-248 


A40 


1 


D40 


211347 


211365 


E40 


211367 


211385 


8-279-197 


A41 


1 


D41 


212501 


212519 


E41 


212521 


212539 


8-278-289 


A42 


1 


D42 


212802 


212820 


E42 


212822 


212840 


8-277-250 


A43 




D43 


213814 


213832 


E43 


213834 


213852 


8-124-294 


A44 


1 


D44 


214838 


214856 


E44 


214858 


214876 


8-273-383 


A45 


1 


D45 


215328 


215346 


E45 


215348 


215366 


8-130-143 / 
8-126-286 


A46 


1 


D46 


216948 


216966 


E46 


216968 


216986 


99-5897-143 


A47 


1 


D47 


219521 


219539 


E47 


219541 


219559 


99-24649-186 


A48 


1 


D48 


220817 


220835 


E48 


220837 


220855 


8-247-337 


A49 


1 


D49 


223211 


223229 


E49 


223231 


223249 


8-246-393 


A50 


1 


D50 


224194 


224212 


E50 


224214 


224232 


8-245-270 


A51 


1 


D51 


224773 


224791 


E51 


224793 


224811 


8-119-247 


A52 


1 


D52 


230376 


230394 


E52 


230396 


230414 


99-15870-400 


A53 




D53 


231658 


231676 


E53 


231678 


231696 


99-16087-219 


A54 




D54 : 


235296 


235314 


E54 


235316 


235334 


8-155-258 


A55 




D55 : 


238770 : 


238788 


E55 


238790 


238808 


99-16038-118 


A56 




D56 : 


239744 : 


239762 


E56 : 


239764 . 


239782 


8-136-267 


A57 




D57 : 


23 9744 : 


239762 


E57 : 


239764 : 


239782 
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99-15880-162 


A58 


1 


D58 


241842 


241860 


E58 


241862 


241880 


8-271-227 


A59 


1 


D59 


243681 


243699 


E59 


243701 


243719 


8-267-273 


A60 


I 


D60 






E60 






8-266-131 


A61 


1 


D61 


245644 


245662 


E61 


245664 


245682 


8-265-208 


A62 


1 


D62 


245947 


245965 


E62 


245967 


245985 


8-265-203 


A63 


1 


D63 


245952 


245970 


E63 


245972 


245990 


99-25940-182 


A64 


1 


D64 


247845 


247863 


E64 


247865 


247883 


99-16032-292 


A65 


1 


D65 


248296 


248314 


E65 


248316 


248334 


99-16105-152 


A66 


1 


D66 


255829 


255847 


E66 


255849 


255867 


99-16033-244 


A67 


1 


D67 


260080 


260098 


E67 


260100 


260118 


99-15875-165 


A68 




D68 


279770 


279788 


E68 


279790 


279808 


99-16052-214 


A69 


' 


D69 


300346 


300364 


E69 


300366 


300384 


99-16047-115 


A70 




D70 


312011 


312029 


E70 


312031 


312049 


99-25993-367 


A71 


1 


D71 


315995 


316013 


E71 


316015 


316033 


Marker Name 


Biallelic 
Marker 


SEQ 

ill INO* 


Mis. 1 


Position range of 
microsequencing 
primer mis. 1 in 
SEQ ID No. 


Mis. 2 


Complem 
position r 
microseqi 
primer m 

OKAl ID 1> 


entary 
angeof 
lencing 
is. 2 in 

IO. 


8-102-231 


A72 


24 


D72 


212 


230 


E72 


232 


250 


8-104-105 


A73 


25 


D73 


86 


104 


E73 


106 


124 


8-94-252 


A74 


26 


D74 


232 


250 


E74 


252 


270 


8-95-43 


A75 


27 


D75 


24 


42 


E75 


44 


62 


8-97-181 


A76 


28 


D76 


162 


180 


E76 


182 


200 


8-97-98 


Ml 


28 


D77 


79 


97 


E77 


99 


117 


8-98-68 


A78 


29 


D78 


49 


67 


E78 


69 


87 


99-14021-108 


A79 


30 


D79 


89 


107 


E79 


109 


127 


99-14364-415 


A80 


31 


D80 


396 


414 


E80 


416 


434 


99-15056-99 


A81 


32 


D81 


80 


98 


E81 


100 


118 


99-15229-412 


A82 


33 


D82 


393 


411 


E82 


413 


431 


99-15232-291 


A83 


34 


D83 


272 


290 


E83 


292 


310 


99-15253-382 


A84 


35 


D84 


363 


381 


E84 


383 


401 


99-15256-392 


A85 


36 


D85 


373 


391 


E85 


393 


411 


99-15258-337 


A86 


37 


D86 


318 


336 


E86 


338 


356 


99-15261-202 


A87 


38 


D87 


181 


199 


E87 


201 


219 
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99-15280-432 


A88 


39 


D88 


413 


431 


E88 


433 


451 


99-15355-150 


A89 


40 


D89 


131 


149 


E89 


151 


169 


99-15663-298 


A90 


41 


D90 


279 


297 


E90 


299 


317 


99-15664-185 


A91 


42 


D9I 


166 


184 


E91 


186 


204 


99-15665-398 


A92 


43 


D92 


379 


397 


E92 


399 


417 


99-15668-139 


A93 


44 


D93 


120 


138 


E93 


140 


158 


99-15672-166 


A94 


45 


D94 


147 


165 


E94 


167 


185 


99-15682-318 


A95 


46 


D95 


299 


317 


E95 


319 


337 


99-16081-217 


A96 


47 


D96 


19S 


216 


E96 


218 


236 


99-16082-218 


A97 


48 


D97 


199 


217 


E97 


219 


237 


99-20933-81 


A98 


49 


D98 


62 


80 


E98 


82 


100 


99-20977-72 


A99 


50 


D99 


52 


70 


E99 


72 


90 


99-20978-89 


A100 


51 


D100 


70 


88 


El 00 


90 


108 


99-20981-300 


A101 


52 


D101 


281 


299 


E101 


301 


319 


99-20983-48 


A102 


53 


D102 


29 


47 


El 02 


49 


67 


99-22310-148 


A103 


54 


D103 


129 


147 


El 03 


149 


167 


99-25922-147 


A104 


55 


D104 


128 


146 


El 04 


148 


166 


99-25950-121 


A105 


56 


D105 


102 


120 


E105 


122 


140 


99-25961-376 


A106 


57 


D106 


354 


372 


E106 


374 


392 


99-25965-399 


A107 


58 


D107 


380 


398 


E107 


400 


418 


99-25966-241 


A108 


59 


DI08 


222 


240 


E108 


242 


260 


99-25967-57 


A109 


60 


D109 


37 


55 


E109 


57 


75 


99-25969-200 


A110 


61 


DUO 


181 


199 


£1 10 


201 


219 


99-25972-317 


Alll 


62 


Dill 


296 


314 


HI 1 1 


316 


334 


99-25974-143 


A112 


63 


D112 


124 


142 


El 12 


144 


162 


99-25977-311 


A113 


64 


Dl 13 


292 


310 


E113 


312 


330 


99-25979-93 


A114 


65 


Dl 14 


74 


92 


El 14 


94 


112 


99-25985-194 


A115 


66 


Dl 15 


175 


193 


El 15 


195 


213 


99-25989-398 


Al 16 


67 


Dl 16 


379 


397 


El 16 


399 


417 


99-26146-264 


A117 


68 


Dl 17 


245 


263 


El 17 


265 


283 


99-26150-276 


Al 18 


69 


Dl 1 8 


257 


275 


El 1 8 


277 


295 


99-26153-44 


A119 


70 


Dl 19 


25 


43 


E119 


45 


63 


99-26156-290 


A120 


71 


D120 


271 


289 


E120 


291 


309 


99-27345-189 


A121 


72 


DI2I 


170 


188 


E121 


190 


208 
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99-30329-380 


A122 


73 


D122 


361 


379 


E122 


381 


399 


99-31798-344 


A123 


74 


D123 






E123 






99-5873-159 


A124 


75 


D124 


140 


158 


E124 


160 


178 


99-6012-220 


AI25 


76 


D125 


201 


219 


E125 


221 


239 


99-6080-99 


A126 


77 


D126 


80 


98 


E126 


100 


118 


99-7308-157 


A127 


78 


D127 


137 


155 


E127 


157 


175 



Mis 1 and Mis 2 respectively refer to microsequencing primers which hybridized with 
the coding strand or with the non-coding strand of the nuceotide sequences of the invention. 
The microsequencing reaction was performed as follows : 

After purification of the amplification products, the microsequencing reaction mixture 
was prepared by adding, in a 20^1 final volume: 10 pmoi microsequencing oligonucleotide, 1 U 
Thermosequenase (Amersham E79000G), 1 .25 nl Thermosequenase buffer (260 mM Tris HC1 
pH 9.5, 65 mM MgCI 2 ), and the two appropriate fluorescent ddNTPs (Perkin Elmer, Dye 
Terminator Set 401095) complementary to the nucleotides at the polymorphic site of each 
biallelic marker tested, following the manufacturer's recommendations. After 4 minutes at 
94°C, 20 PCR cycles of 15 sec at 55°C, 5 sec at 72°C, and 10 sec at 94°C were carried out in a 
Tetrad PTC-225 thermocycler (MJ Research). The unincorporated dye terminators were then 
removed by ethanol precipitation. Samples were finally resuspended in formamide-EDTA 
loading buffer and heated for 2 min at 95°C before being loaded on a polyacrylamide 
sequencing gel. The data were collected by an ABI PRISM 377 DNA sequencer and processed 
using the GENESCAN software (Perkin Elmer). 

Following gel analysis, data were automatically processed with software that allows the 
determination of the alleles of biallelic markers present in each amplified fragment. 

The software evaluates such factors as whether the intensities of the signals resulting 
from the above microsequencing procedures are weak, normal, or saturated, or whether the 
signals are ambiguous. In addition, the software identifies significant peaks (according to shape 
and height criteria). Among the significant peaks, peaks corresponding to the targeted site are 
identified based on their position. When two significant peaks are detected for the same 
position, each sample is categorized classification as homozygous or heterozygous type based 
on the height ratio. 



Example 5a 

Association Study Between Schizophrenia And The Biallelic Markers Of The Invention 
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Collection Of DNA Samples From Affected And Non- Affected Individuals 
A) Affected population 

All the samples were collected from a large epidemiological study of schizophrenia 
undertaken in hospital centers of Quebec from October 1995 to April 1997. The population was 
composed of French Caucasian individuals. The study design consisted in the ascertainment of 
cases and two of their first degree relatives (parents or siblings). 

As a whole, 956 schizophrenic cases were ascertained according to the following 
inclusion criteria: 

- the diagnosis had been done by a psychiatrist; 

- the diagnosis had been done at least 3 years before recruitment time, in order to 
exclude individuals suffering from transient manic-depressive psychosis or depressive 
disorders; 

- the patient ancestors had been living in Quebec for at least 6 generations; 

- it was possible to get a blood sample from 2 close relatives. 

Among the 956 schizophrenic ascertained cases, 834 individuals were included in the 
study for the following reasons: 

- for the included individual cases, the diagnosis of schizophrenia was established 
according to the DSM-IV (Diagnostic and Statistical Manual, Fourth edition, Revised 1994, 
American Psychiatric Press); 

- samples from individuals suffering from schizoaffective disorder were discarded; 

- individuals suffering from catatonic schizophrenia were also excluded from the 
population of schizophrenic cases; 

- were also excluded the individuals having a first degree relative or 2 or more second 
degree relatives suffering from depression or mood disorder; 

- individuals having had severe head trauma, severe obstretical complications, 
encephalitis, or meningitis before onset of symptoms were also excluded; 

- has also been excluded from the population of schizophrenic cases a patient suffering 
from epilepsy and treated with anticonvulsants. 

The age at onset was not added as an inclusion criteria. 
B) Unaffected population 

Control cases were respectively ascertained based on the following cumulative criteria: 

- the individual must not be affected by schizophrenia or any other psychiatric disorder, 

- the individual must have 35 years old or more; 

- the individual must belong to the French-Canadian population; 

- the individual must have one or two first degree relative available for blood sampling. 
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Controls were matched with cases sex when possible. 
C) Cases and Control Populations Selected for the Association Study 
The unaffected population retained for the study was composed of 241 individuals. The 
initial sample of the clinical study was composed of 215 cases and 214 controls. The controls 
were composed of 1 16 males and 98 females while the cases were composed of 154 males and 
64 females. For each control, two first degree relatives (father, mother, sisters and brothers) 
were available. In order to match the sex of cases and controls, the parents of female controls 
were substituted for the female controls where possible and where the parents were known to be 
unaffected by schizophrenia or other psychosis. The parents of 27 female controls were thus 
substituted for the respective females, resulting in a total control sample size of 241 individuals. 
The composition of the control sample is detailed below in Table 7. 

Table 7 



Description of control samples 



Probands 


187 


Male 


116 


Female 


71 


Parents of probands 


54 


Fathers 


27 


Mothers 


27 


Total 


241 



The association data that are presented below were obtained on a population size 
detailed in Table 8 below, wherein the individuals have been randomly selected from the 
populations detailed above. 



Table 8 



Cases and Control Populations Selected for the Association Study 






sample type 


Cases 


Controls 


sample size 




215 


241 


Gender 










Male 


151 


143 




Female 


64 


98 


Familial history of psychosis (FH)* 








positive (FH+) 


82 


0 




none (FH-) 


133 


241 
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* : close relatives (first or second degree) 



Both case and control populations form two groups, each group consisting of unrelated 
individuals that do not share a known common ancestor. Additionally, the individuals of the 
control population were selected among those having no family history of schizophrenia or 
schizophrenic disorder. 

Genotyping of affected and control individuals 

A) Results from the genotvoing 

The general strategy to perform the association studies was to individually scan the 
DNA samples from all individuals in each of the populations described above in order to 
establish the allele frequencies of biallelic markers, and among them the biallelic markers of the 
invention, in the diploid genome of the tested individuals belonging to each of these 
populations. 

Allelic frequencies of every biallelic marker in each population (cases and controls) 
were determined by performing microsequencing reactions on amplified fragments obtained by 
genomic PCR performed on the DNA samples from each individual. Genomic PCR and 
microsequencing were performed as detailed above in Examples 1 to 4 using the described PCR 
and microsequencing primers. 

Single biallelic marker frequency analysis 

For each allele of the biallelic markers included in this study, the difference between the 
allelic frequency in the unaffected population and in the population affected by schizophrenia 
was calculated and the absolute value of the difference was determined. The more the 
difference in allelic frequency for a particular biallelic marker or a particular set of biallelic 
markers, the more probable an association between the genomic region harboring this particular 
biallelic marker or set of biallelic markers and schizophrenia. Allelic frequencies were also 
useful to check that the markers used in the haplotype studies meet the Hardy- Weinberg 
proportions (random mating). 

The allelic frequencies of biallelic markers in the chromosome 13q31-q33 region 
between the affected and the unaffected population, using the sample population described 
above, is set forth in Table 9. 



Table 9 

Allelic frequencies of markers in different sub-samples 



marker 


alleles 


all sample 






cases 


controls 






all 


HF+ 


HF- 
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99-20978/89 


C/G 


0.51 


0 47 


ft < i 


A << 
U.JJ 


99-20983/48 


A/G 


0 30 


0 98 


ft TJ 

U.3J 


A lO 

0.29 


99-20981/300 


A/G 


0.54 


0 <, 1 
U.J 1 


U.jj 


A C£ 

U.jO 


99-20977/72 


A/C 


0.40 


ft 4 1 


ft 18 

U.3o 


A 1 C 

0.35 


99-6080/99 


C/T 


0 58 

U.JO 


0 S7 
U.J / 


ft <"7 

U.j / 


A CC 

0.55 


99-15229/412 


A/n 


0 S4 

V. J*f 


ft co 


A CC 

0.55 


0.53 


99-22110/148 


P/T 


0 46 


A H Q 


Pi A A 

0.44 


0.47 


99-15232/291 

77 1 J t*J LI 1 


r/T 


0 46 
U.40 


ft y!8 
U.4S 


0,43 


0.47 


99-14021/108 


A/n 


0 46 


ft /IB 

U.4o 


A AA 

U.44 


0.47 


8-98/68 


A/n 

r\j\J 


ft "Jfl 


A 1 8 


A T3 

0.23 


0.19 


8-97/98 


err 
\j 1 


ft 78 


A *TC 

0.75 


0.81 


0.80 


99-6019/990 


w 1 


ft "/A 
U.2U 


A 1 A 

U.19 


0.23 


0.19 


8-9 V41 

O 7 J/*+ J 


A/vJ 


ft 1 8 


a on 
0.20 


0.18 


0.21 


99-71051/ 1 S7 

77" / JUO/ i j / 


p/t 


ft 1Q 


A A O 

0.42 


0.36 


0.39 


00. 147 Ad Ml 
77- 1 *t jO*»/ *f i j 


w 1 


0.38 


0.40 


0.36 


0.39 


99-1 ^679/166 


err 


A < 1 


A A "7 

0.47 


0.54 


0.54 


QO-1 <«66ft/1 30 
77- 1 3000/ 1 jy 


U/ 1 


A CD 


0.56 


0.62 


0.65 


77" 1 J00j/J70 


A/U 


A T"> 

0.72 


0.67 


0.72 


0.76 


77- 1 joo j/zv o 


err 
C/ 1 


0.72 


0.67 


0.72 


0.76 


90-1 S664/1 R<< 

1 JOOf / I Q J 


err 
U/ 1 


A iCA 

u.oy 


0.62 


0.72 


0.72 


OO 1 ^689/7 1 8 

77- 1 JOO// J 1 O 


a rr 
A/ 1 


A If 

0.35 


0.40 


0.34 


0.32 


99-90971/R1 

77-iU7JJ/ O 1 


a le 


A A1 

U.4J 


A A 1 

0.41 


0.42 


0.40 


00. 1 6081 /91 7 

77 1 QUO 1 1 / 




A A1 


0.38 


0.46 


0.39 


99-1 6089/91 R 

77 1 DUOZ/Z 1 o 


a /n 
A/w 


0.33 


A Tf 1 

0.31 


0.35 


0.32 


99-S869/1 67 


V-/ l 


A A1 

U.4 / 


A A Tt 

0.43 


0.44 


0.51 


99-16100/147 


a /n 
A/O 


A AQ 


0.44 


0.45 


0.50 


QQ.76S9/169 

7 7" /O JZ/ I OZ 


a in 


A /1Q 

0.49 


0.46 


0.46 


0.52 


OO 1 Q/9 1 


A/O 


0.66 


0.71 


0.69 


0.60 


00 ^807/143 


A/U 


A CO 
U.J 5 


0.61 


0.53 


0.59 


99-1 ^870/400 

77* 1 JO /U/HUU 


A/O 


a n 


0.38 


0.27 


0.33 


99-1 6019/999 


a le 


A XI 
U.O 1 


0.62 


0.64 


0.58 


99-15880/162 


A/G 


0.62 


0.63 


0.65 


0 

V.JO 


99-16038/118 


A/G 


0.38 


0.36 


0.35 


0.42 


99-15875/165 


C/T 


0.58 


0.57 


0.57 


0.63 


99-16033/244 


C/T 


0.55 


0.57 


0.49 


0.54 


99-16047/115 


C/T 


0.73 


0.75 


0.68 


0.73 



In the association study described herein, several individual biallelic markers were 
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shown to be significantly associated with schizophrenia. In particular, several of the 
chromosome 13q31-q33 region biallelic markers (99-16038/1 18 (A56), 99-7652/162 (A8), 99- 
5919/215 (Al 1), 99-15875/165 (A68), 99-16032/292 (A9)) showed significant association with 
schizophrenia in both familial and sporadic schizophrenia cases. The significance of the 
absolute value of the difference of allelic frequency of the individual biallelic markers in the 
affected and the unaffected population is set forth in 2, with several biallelic marker having 
allelic frequency differences with p-values approaching or less than 0.05, biallelic marker 99- 
5919/215 (A 1 1) having a p-value of less than 0.01. Figure 2 also shows the physical order of 
certain specific biallelic markers. These results show that several biallelic markers individually 
associated with schizophrenia are physically located in a particular region of significance, the 
subregion of the chromosome 13q3 l-q33 region referred to herein as Region D. 
Haplotype frequency analysis 

Analysis of markers Haplotype analysis for association of chromosome 13q31-q33- 
related biallelic markers and schizophrenia was performed by estimating the frequencies of all 
possible 2, 3 and 4 marker haplotypes in the affected and control populations described above. 
Haplotype estimations were performed by applying the Expectation-Maximization (EM) 
algorithm (Excoffier and Slatkin, 1995), using the EM-HAPLO program (Hawley et al., 1994) 
as described above. Estimated haplotype frequencies in the affected and control population 
were compared by means of a chi-square statistical test (one degree of freedom). 

Haplotype association results in schizophrenia cases 

The results of the haplotype analysis using the chromosome I3q31-q33-related biallelic 
markers biallelic markers is shown in Figure 3. In particular, the figures show the most 
significant haplotypes using the biallelic markers: 99-16047/1 15 (A 10), 99-16033/244 (A67), 
99-16038/1 18 (A56), 99-15875/165 (A68), 99-16032/292 (A9), 99-5897/143 (A47), 99- 
15880/162 (A58), 99-16082/218 (A97), 99-5919/215 (Al 1), 99-7652/162 (A8), 99-16100/147 
(A9), 99-5862/167 (AI0). 

A number of biallelic marker haplotypes were shown to be significantly associated with 

schizophrenia. A first preferred haplotype (HAP287 of Figure 3) consisting of four biallelic 

markers (99-16038/1 18 (A56), 99-16082/218 (A97), (99-7652/162 (A8) and 99-16100/147 

(A9)) is highly significantly associated with schizophrenia in both total cases and sporadic 

cases. Figure 4 shows the characteristics of this haplotype. This haplotype presented a p-value 

-7 6 
of 3 . 1 x 1 0 and an odd-ratio of 4.0 1 for total cases and a p-value of 3 .9x 1 0 and an odd-ratio 

of 3.88 for sporadic cases. Phenotypic permutation tests confirmed the statistical significance 

of these results. Estimated haplotype frequencies were 13.8% in total cases, 13.5% in the 

sporadic cases, and 3.8 % in the controls. 



WO 01/40493 PCT/IBOO/01507 

173 

Several other significant haplotypes are listed in Figure 3, including several 2-, 3- and 
4-marker haplotypes. Considered to be highly significantly associated with schizophrenia are 
the most significant 2-marker haplotype (HAP1 consisting of biallelic markers 99-15875/165 
(A68) and 99-5919/215 (Al 1)) and the most significant 3-marker haplotype (HAP67 consisting 
of biallelic markers 99-1 6038/1 18 (A56), 99-1 6082/2 1 8 (A97) and 99-7652/162 (A8)). 

Further preferrred significant haplotypes considered associated with schizophrenia are 
haplotypes having p-values above a desired threshold level are also; all the haplotypes listed in 
Figure 3 present p-values below 1 .OxlO' 2 for 2-marker haplotypes, 1 .OxlO" 4 for 3-marker 
haplotypes, and 1 .0x1 0' 5 for 4-marker haplotypes. All of the biallelic markers presented in 
Figure 4 except for 1 (99-16047/1 15 (A70)) are involved in haplotypes having a p-value above 
these threshold levels. Figure 3 shows several 2-marker haplotypes, HAP1 to HAP8, having p- 
values ranging from 1 .OxlO" 2 to 1 .2xI0* 3 , several 3-marker haplotypes, HAP67 to HAP76, 
having p-values ranging from 1 .3x 1 0' 5 to 1 .0x1 0"* and several 4-marker haplotypes, HAP287 to 
HAP291, having p-values ranging from 8.2xl0" 7 to 3.1xl0" 7 . Figure 4 shows biallelic markers 
involved in significant haplotypes having significance thresholds of 1. OxlO" 2 , 1.0x10"*, and 
1 .OxlO" 5 for 2-, 3- and 4-marker haplotypes, respectively. 

Several 2-, 3- and 4-marker haplotypes, HAP 1, HAP8, HAP70, HAP71, HAP75, 
HAP76, HAP288, HAP290 and HAP291, often comprised the biallelic marker 99-5919/215 
(Al 1) allele A. Furthermore, several 2-, 3- and 4-marker haplotypes, HAP7, HAP67, HAP69, 
HAP75, HAP287 AND HAP288, often comprised the biallelic marker 99-16038/1 18 (A56) 
allele G. 

Example 5b 

Association Study Between Schizophrenia And The Biallelic Markers Of The Invention 

Collection Of DNA Samples From Affected And Non-Affected Individuals 

Biallelic markers of the invention were further analyzed in the French Canadian 
population described above. For this analysis, the proband case population under study 
consisted of 139 individuals, the control population consisted of 141 individuals, as described in 
Table 10 below. 



Table 10 



Cases and Control Populations Selected for the Association 
Study 






Sample type 


Cases 


Controls 


Sample size 




139 


141 


Gender 










Male 


94 


96 
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Female 


45 


45 


Familial history of psychosis (FH)* 








positive (FH+) 


76 


0 




none (FH-) 


63 


141 


* : close relatives (first or second degree) 







Genotyping of affected and control individuals 
A) Results from the 2enotyping 

The general strategy for performing the association studies was to individually scan the 
5 DNA samples from all individuals in each of the populations described above in order to 

establish the allele frequencies of biallelic markers, and among them the biallelic markers of the 
invention, in the diploid genome of the tested individuals belonging to each of these 
populations. 

Allelic frequencies of every biallelic marker in each population (cases and controls) 
10 were determined by performing microsequencing reactions on amplified fragments obtained by 

genomic PCR performed on the DNA samples from each individual. Genomic PCR and 
microsequencing were performed as detailed above in Examples 1 to 3 using the described PCR 
and microsequencing primers. 

Single biallelic marker frequency analysis 
1 5 For each allele of the biallelic markers included in this study, the difference between the 

allelic frequency in the unaffected population and in the population affected by schizophrenia 
was calculated and the absolute value of the difference was determined. The allelic frequencies 
of between the affected and the unaffected population in the regions is set forth in Table 1 1, 
using the sample population described above and in Table 10. The more the difference in allelic 
20 frequency for a particular biallelic marker or a particular set of biallelic markers, the more 

probable an association between the genomic region harboring this particular biallelic marker or 
set of biallelic markers and schizophrenia. Allelic frequencies were also useful to check that the 
markers used in the haplotype studies meet the Hardy- Weinberg proportions (random mating). 

Table 11 

25 Allelic frequencies of markers in differents sub-samples (%) 



Marker 


polymorphism 


Cases 


All controls 


All cases 


HF+ 


HF- 


99-20978/89 


C/G 


50,37 


47,26 


54,03 


55,43 


99-20983/48 


AJG 


30,37 


28,67 


32,5 


26,52 
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99-20977/72 


A/C 


41,01 


42,11 


39,68 


34,4 


99-20981/300 


A/G 


52,17 


51,33 


53,17 


60 


99-6080/99 


C/T 


58,82 


58 


59,84 


54,85 


99-15229/412 


A/G 


54,92 


52,86 


57,26 


51,88 


99-22310/148 


C/T 


44,2 


46,71 


41,13 


48,57 


99-15232/291 


G/T 


43,85 


46,43 


40,83 


49,28 


99-14021/108 


A/G 


44,85 


47,26 


42,06 


48,54 


8-94/252 


A/G 


2,22 


1,97 


2,54 


2 52 


8-98/68 


A/G 


19,06 


17,76 


20,63 


19,06 


8-97/98 


C/T 


76,26 


74,34 


78,57 


77,3 


99-6012/220 


G/T 


20 


18,49 


21,77 


18,79 


99-7308/157 


C/T 


40,31 


41,89 


38,18 


39,36 


99-14364/415 


C/T 


39,93 


40,79 


38,89 


40 


8-95/43 


A/G 


20,29 


20,39 


20,16 


22,14 


99-15672/166 


C/T 


49,28 


47,37 


51,59 


56,74 


99-15668/139 


C/T 


58,21 


56,16 


60,66 


66,67 


99-15665/398 


A/G 


70,5 


67,76 


73,81 


76,79 


99-15663/298 


C/T 


70 5 


67,76 


73,81 


76,95 


99-15664/185 


G/T 


66,54 


62,33 


71,43 


72 5 


99-15682/318 


A/T 


35,27 


39,58 


29,82 


32,66 


99-20933/81 


A/C 


43,12 


42,76 


43,55 


42 45 


99-26146/264 


G/T 


39,62 


38,67 


40,91 


38,85 


99-25922/147 


G/T 


44,19 


39,58 


50 


40,94 


99-16081/217 


C/T 


42,28 


38,82 


46,67 


16 74 


99-16082/218 


*A/G 


34,73 


31,94 


38,14 


33,81 


99-24656/260 


A/G 


48,87 


49,32 


48,31 




99-24639/163 


G/T 


38,52 


33,33 


45 




99-24634/108 


A/T 


44,85 


42,67 


47,54 


50 


99-7652/162 


A/G 


45,29 


44,08 


46,77 


50,36 


99-16100/147 


A/G 


44,66 


42,75 


46,77 


48,89 


99-5862/167 


C/T 


43,53 


41,45 


46,03 


49,29 


99-5919/215 


A/G 


69,42 


71,05 


67,46 


60,28 


99-24658/410 


C/T 


64,13 


69,08 


58,06 


61,07 


99-24644/194 


A/G 


39,42 


41,22 


37 3 


40,51 


99-5897/143 


A/C 


57,61 


60,67 


53,97 


61,07 


99-24649/186 


C/T 


67,75 


67,33 


68,25 


62,95 


99-15870/400 


A/G 


33,46 


36,67 


29,51 


30,29 


99-16038/118 


A/G 


34,53 


36,18 


32,54 


43,62 
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99-15880/162 


A/G 


65,11 


63,16 


67,46 


56,43 


99-25940/182 


A/G 


59,42 


56,67 


62J 


52,59 


99-16032/292 


A/C 


64,03 


61,84 


66,67 


55,67 


99-16033/244 


C/T 


54,51 


56,76 


51,69 


56,44 


99-15875/165 


C/T 


56,88 


57,89 


55,65 


66,3 


99-16047/115 


C/T 


71,69 


74,67 


68,03 


75,19 


99-25993/367 


A/G 


44,53 


40,79 


49,18 


40,51 


99-25989/398 


A/G 


32,81 


33,33 


32,2 


27,86 


99-25979/93 


A/G 


68,12 


69,08 


66,94 


69 19 


99-25969/200 


G/T 


36,67 


38,67 


34,17 


18 ftS 


99-25966/241 


A/G 


66,3 


67,1 1 


65,32 


61 2\ 


99-25961/376 


A/C 


39,63 


42,57 


36,07 


17 11 


99-25965/399 


A/G 


50,36 


51,97 


48 39 




99-25977/311 


A/G 


72,01 


67,76 


77,59 


71 79 


99-25950/121 


C/G 


31,75 


36 


26 61 


97 S4 


99-25974/143 


A/G 


25,55 


28,29 




99 7 


99-26150/276 


A/G 


46,54 


51,43 


40,83 


47 76 


99-15258/337 


G/T 


25,55 


26,97 


23,77 


94 1 


99-15261/202 


A/G 


63,06 


59,46 


67 5 


6S 1 S 

O J, 1 J 


99-15256/392 


C/T 


64,96 


61,33 


69,35 


6^ 1 


99-15056/99 


C/T 


32,72 


36,49 


28,23 


31,11 


99-15280/432 


C/T 


42,28 


44 


40,16 


38,97 


99-15355/150 


C/T 


72,3 


70,39 


74,6 


68,79 


99-15253/382 


C/T 


63,04 


62,67 


63,49 


62,95 


99-5873/159 


C/T 


78,1 


79,05 


76,98 


77,34 



Haplotype frequency analysis 

Analysis of markers Haplotype analysis for association of chromosome 13q31-q33- 
related biallelic markers and schizophrenia was performed by estimating the frequencies of all 
5 possible 2, 3 and 4 marker haplotypes in the affected and control populations described above. 

Haplotype estimations were performed by applying the Expectation-Maximization (EM) 
algorithm (Excoffier and Slatkin, 1995), using the EM-HAPLO program (Hawley et al., 1994) 
as described above. Estimated haplotype frequencies in the affected and control population 
were compared by means of a chi-square statistical test (one degree of freedom). 
10 Haplotype association results in schizophrenia cases 

Haplotype studies yielded significant results indicating an association of the nucleotide 
sequences of the invention with schizophrenia. Significant results are shown in Figures 5 and 6, 
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including descriptions of the frequency of the haplotype leading to the maximum chi square test 
(reference no. (1) in figures), the test of the frequency of a particular haplotype in cases vs in 
controls (reference no. (2) in figures) and the p- value assuming that the test has a chi-square 
distribution with 1 degree of freedom (ddl) (reference no. (3) in figures). The results of the 
haplotype analysis using 28 preferred biallelic markers of the invention, 99-24656-260 (A5), 
99-24639-163 (A6), 99-24634-108 (A7), 99-7652-162 (A8), 99-16100-147 (A9), 99-5862-167 
(A10), 99-5919-215 (All), 99-24658-410 (A 12), 99-24644-194 (A19), 99-5897-143 (A47), 99- 
24649-186 (A48), 99-16038-1 18 (A56), 99-15880-162 (A58), 99-25940-182 (A64), 99-16032- 
292 (A9), 99-16033-244 (A67), 99-15875-165 (A68), 99-16047-1 15 (A70), 99-25950-121 
(A105), 99-25961-376 (A106), 99-25965-399 (A 103), 99-25966-241 (A108), 99-25969-200 
(Al 10), 99-25974-143 (Al 12), 99-25977-31 1 (Al 13), 99-25979-93 (Al 14), 99-25989-398 
(A 1 16), and 99-26150-276 (Al 18) are shown in Figures 5 and 6. Figures 5 and 6 also show the 
physical order of the biallelic markers comprising the haplotypes. 

Figure 5 shows the results of the haplotype analysis using the following biallelic 
markers located on the approximately 3 1 9kb sequence of SEQ ID No. 1 : 99-24656-260 (A5), 
99-24639-163 (A6), 99-24634-108 (A7), 99-7652-162 (A8), 99-16100-147 (A9), 99-5862-167 
(AI0), 99-5919-215 (A 1 1), 99-24658-410 (A12), 99-24644-194 (A19), 99-5897-143 (A47), 99- 
24649-186 (A48), 99-16038-1 18 (A56), 99-15880-162 (A58), 99-25940-182 (A64), 99-16032- 
292 (A65), 99-16033-244 (A67), 99-15875-165 (A68), and 99-16047-1 15 (A70). 

Figure 6 shows the results of the haplotype analysis using the following biallelic 
markers located on the approximately 3 19kb of SEQ ID No. las well as additional biallelic 
markers located on the human chromosome 13q3 l-q33 locus: 199-16038-1 18 (A56), 99-1 5880- 
162 (A58), 99-25940-182 (A64), 99-16032-292 (A65), 99-16033-244 (A67), 99-15875-165 
(A68), 99-16047-1 15 (A70), 99-25950-121 (A 105), 99-25961-376 (A106), 99-25965-399 
(A102), 99-25966-241 (A108), 99-25969-200 (Al 10), 99-25974-143 (Al 12), 99-25977-31 1 
(Al 13), 99-25979-93 (Al 14), 99-25989-398 (Al 16), and 99-26150-276 (Al 19). 

A number of biallelic marker haplotypes were shown to be significantly associated with 
schizophrenia. 

Several preferred haplotype all showing highly significant association with 
schizophrenia and including various 2-, 3- and 4- marker haplotypes are haplotypes 817, 818 
and 819, 137, 138, 1 and 2 of Figure 6, and haplotypes 970, 154 and 1 of Figure 5. The p- 
values, odd-ratios and estimated haplotype frequencies are further described in Figures 5 and 6. 
In particular, the two marker haplotype 1 of Figure 5 consisting of biallelic markers 99-5862- 

167 (A10) and 99-15875-165 (A68) showed a highly significant p-value of 7.8xl0~ 5 and an 
odd-ratio of 1.61. Haplotype 818 of Figure 6 consisting of four biallelic markers (99-16032-292 
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(A65), 99-25969-200 (Al 10), 99-25977-31 1 (A 1 13), and 99-25989-398 (AI 16)) presented a p- 

value of 3.1xl0~ 7 and an odd-ratio of 9.08. Another example showing significance is haplotype 
817 of Figure 6 consisting of four biallelic markers (99-16033-244 (A67), 99-15875-165 (A68), 

99-25950-121 (AI05) and 99-25979-93 (Al 14)), presented a p-value of 2.4xl0" 7 and an odd- 
ratio of 100. Phenotypic permutation tests confirmed the statistical significance of these results. 
Estimated haplotype frequencies were 1 0.5% in cases and 0 % in the controls. Haplotype 970 of 
Figure 5 consisting of four biallelic markers (99-5919-215 (Al 1), 99-24658-410 (A 12), 99- 

15875-165 (A68), and 99-16047-1 15 (A 10)) presented a p-value of 7.8x1 0" 7 and an odd-ratio 
of 2.41 . Phenotypic permutation tests confirmed the statistical significance of these results. 
Estimated haplotype frequencies were 25.7% in cases and 12.5 % in the controls. 

Several other significant haplotypes are listed in Figures 5 and 6, including several 2-, 
3- and 4-marker haplotypes. Considered to be highly significantly associated with 
schizophrenia are the most significant 2-marker haplotypes (for example haplotype 1 of Figure 
5 noted above and the most significant 3-marker haplotypes (for example haplotype 137 of 
Figure 6 consisting of biallelic markers (99-1 5875-165 (A68), 99-16047-1 15 (A70) and 99- 
25950-121 (A105)). 

Further preferrred significant haplotypes considered associated with schizophrenia are 
haplotypes having p-values above a desired threshold level; all the haplotypes listed in Figures 5 
and 6 present p-values below l.OxlO" 2 for 2-marker haplotypes, l.OxlO" 4 for 3-marker 
haplotypes, and 1 .OxlO* 5 for 4-marker haplotypes. Figures 5 and 6 show several 2-marker 
haplotypes, haplotypes 1 to 9 and haplotypes 1 to 5 of Figures 5 and 6 respectively, having p- 
values ranging from 7.8xl0* 5 to 8.6xlO'\ several 3-marker haplotypes, haplotypes 154 to 163 
and 137 to 141 of Figures 5 and 6 respectively, having p-values ranging from 3.9X10 -6 to 
l.lxl 0* 4 and several 4-marker haplotypes, haplotypes 970 to 973 and 817 to 836 of Figures 5 
and 6 respectively, having p-values ranging from 2.4x1 0* 7 to 7.3x1 0" 6 . 

Additionally, a particularly large number of the significant 2-, 3- and 4-marker 
haplotypes often comprised the biallelic markers A65, A 12, A67, A70, A 1 07, A 1 10, A 1 16 and 
most commonly A68 (99-15875-165), allele T. 

The statistical significance of the results obtained for the haplotype analysis was 
evaluated by a phenotypic permutation test reiterated 100 times on a computer. For this 
computer simulation, data from the affected and control individuals were pooled and randomly 
allocated to two groups which contained the same number of individuals as the case-control 
populations used to produce the data summarized in figures 5 and 6. A haplotype analysis was 
then run on these artificial groups for the markers included in the haplotypes showing strong 
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association with schizophrenia. This experiment was reiterated 100 times and the results are 
shown in the columns of Figures 5 and 6 labelled "Haplotype test by permutation procedure". 
For a given haplotype, these results demonstrate the number of obtained (simulated) haplotypes 
having a p- value comparable to the one obtained for the given haplotype among 100 iterations. 
These results, set forth in Figures 5 and 6 validate the statistical significance of the association 
between the haplotypes and schizophrenia . 

Example 5c 

Association Study Between Schizophrenia and the Biallelic Markers of the Invention in 

French Canadian Samples 
Collection Of DNA Samples From Affected And Non-AfTected Individuals 

Biallelic markers of the present invention were farther genotyped in French Canadian 
samples as described above in order to compare the association of the 1st and the 2nd portion of 
Region D with schizophrenia. The population used in the study was the same as described 
above with the exception that 2 male FH+ cases were not included. 

The biallelic markers analyzed in the study include 34 preferred biallelic markers of the 
invention located in Region D of the chromosome 13q31-33 region. Included in the analysis 
were the 14 following biallelic markers from the first of two portions of Region D: 99- 
261 50/276 (Al 18), 99-261 56/290 (A120), 99-26153/44 (A 1 1 9), 99-25985/194 (Al 1 5), 99- 
25974/143 (Al 12), 99-25977/3 1 1 (Al 14), 99-25972/317 (A 1 1 1), 99-25965/399 (A107), 99- 
25961/376 (A106), 99-25966/241 (A108), 25967/57 (AI09), 99-25969/200 (Al 10), 99- 
25979/93 (Al 14) and 99-25989/398 (Al 16). Included in the analysis were also the 20 
following biallelic markers from the second of two portions of Region D: 99-25993/367 (A71), 
99-16047/1 15 (A70), 99-15875/165 (A68), 99-16033/244 (A67), 99-16032/292 (A65), 99- 
25940/182 (A64), 99-15880/162 (A58), 99-16038/1 18 (A56), 99-15870/400 (A53), 99- 
24649/186 (A48), 99-5897/143 (A47), 99-24644/194 (A19), 99-24658/410 (A12), 99-5919/215 
(All), 99-5862/167 (A10), 99-16100/147 (A9), 99-7652/162 (A8), 99-24634/108 (A7), 99- 
24639/163 (A6) and 99-24656/260 (A5). 

Single biallelic marker association results in schizophrenia cases 
Single biallelic marker studies yielded significant results, indicating an association of 
the nucleotide sequences of the invention with schizophrenia. Biallelic markers used in the 
analysis included the set of 34 biallelic markers shown in Table 1 1 below, 14 biallelic markers 
of which were located on the first of two portions of Region D, and 20 of which were located on 
the second portion. The distribution of markers in shown in Table 12 below. As summarized in 
Table 13 , analyses using these biallelic markers demonstrated a significant association with 
schizophrenia for 5 markers on the second portion of Region D. 
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Table 11 



REGION 



D 



CONTIG SNPS GENOTYPED I POLYMORPHISM 



1" portion 



2 nd portion 



FREQUENCY IN 
CONTROLS 



99-26150/276 


A/G 


50 


99-26156/290 


A/C 


69 


99-26153/44 


A/C 


61 


99-25985/194 


C/T 


29 


99-25974/143 


A/G 


25 


99-25977/31 1 


A/G 


73 


99-25972/317 


C/T 


72 


99-25965/399 


A/G 


49 


99-25961/376 


A/C 


40 


99-25966/241 


A/G 


63 


99-25967/57 


A/G 


43 


99-25969/200 


G/T 


40 


99-25979/93 


A/G 


72 


99-25989/398 


A/G 


29 


99-25993/367 


A/G 


44 


99-16047/115 


C/T 


73 


99-15875/165 


C/T 


63 


99-16033/244 


C/T 


54 


99-16032/292 


A/C 


58 


99-25940/182 


A/G 


53 


99-15880/162 


A/G 


58 


99-16038/118 


A/G 


42 


99-15870/400 


A/G 


33 


99-24649/186 


C/T 


65 


99-5897/143 


A/C 


59 


99-24644/194 


A/G 


39 


99-24658/410 


C/T 


58 


99-5919/215 


A/G 


60 


99-5862/167 


C/T 


51 


99-16100/147 


A/G 


50 


99-7652/162 


A/G 


52 


99-24634/108 


A/T 


53 


99-24639/163 


G/T 


44 


99-24656/260 


A/G 


54 



Table 12 



Region 


No. of Biallelic 
markers (ct) 


Mean frequency 
(a) 


Mean 

inter-marker 
distance (a) 


D I s ' half 


14(14) 


0.34 (0.07) 


7(6.3) 


D2 na half 


20(8) 


0.42 (0.06) 


11(13) 


D 1 st and 2 nd half 


34 (22) 


0.39 (0.07) 


10.3(11) 



5 

Haplotype frequency analysis 
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Analysis of markers Haplotype analysis for association of chromosome 13q31-q33- 
related biallelic markers and schizophrenia was performed by estimating the frequencies of all 
possible 2, 3 and 4 marker haplotypes in the affected and control populations described above. 
Haplotype estimations were performed by applying the Expectation-Maximization (EM) 
5 algorithm (Excoffier and Slatkin, 1995), using the EM-HAPLO program (Hawley et al., 1994) 

as described above. 

Haplotype association results in schizophrenia cases 

Significant results were also obtained in haplotype studies indicating an association of 
the nucleotide sequences of the invention with schizophrenia. 

1 0 The present inventors having previously demonstrated highly significant association of 

biallelic markers located on the Region D subregion of the human chromosome 13q31-q33 
locus with disease. Using the Omnibus LR test which compares the profile of haplotype 
frequencies, and Haplo-maxM test which is based on haplotype differences for each haplotype 
in two groups, Figures 7 and 8 describe the results of an analysis of the first and second portions 

1 5 of Region D which demonstrated an association of the second portion of Region D with 

schizophrenia. 

For combinations of 2 and 3 biallelic markers, one likelihood ratio test is obtained based 
on the haplotype frequency values calculated using the E-M algorithm. A permutation 
procedure was used, where data from the affected and control individuals was pooled and 

20 randomly allocated to two groups which contained the same number of individuals as the case- 

control populations used to produce the data. A haplotype analysis was then run on these 
artificial groups for the markers included in the haplotypes showing strong association with 
schizophrenia. This experiment was reiterated 100 times. For a given haplotype, these results 
demonstrate the number of obtained (simulated) haplotypes having a p-value comparable to the 

25 one obtained for the given haplotype among 1 00 iterations. 

Figure 7 shows a comparison of the LR test value distributions of haplotype 
frequencies in the two portions of Region D. This association of the second portion of Region 
D with schizophrenia is shown using both 2-marker and 3-marker combinations. The 
distribution of LR test values in the different regions was analyzed using a Kruskal-Wallis rank 

30 test, a chi-square test with r-1 degrees of freedom, where r represents the number of value sets 

compared. As shown, the significance of the association is demonstrated by a chi-square value 

(one degree of freedom) of 74.405 and a p-value of less than 1 x 1 0" ' ® for 2 marker 

combinations, and a chi-square value (one degree of freedom) of 228.72 and a p-value of 1x10" 

^ for 3- marker combinations. 



WO 01/40493 PCT/TBOO/01507 

182 

Another association analysis approach based on haplotype frequency differences, 
referred to as the Haplo-maxM test, was conducted using region D biallelic markers. For one 
combination of markers having h haplotypes, h differences of haplotype frequencies can be 
compared via a Pearson chi-square statistic (one degree of freedom). The haplo-max test selects 
the difference showing the maximum positive test value between cases versus controls 
(rejecting test values based on rare haplotype frequencies, i.e, with an estimated number of 
haplotypes inferior to 1 0); for one combination of markers there is therefore one Max-M test 
value. The results of the Haplo-maxM test using Region D biallelic markers are shown in 
Figure 8. 

Figure 8 shows the distribution of haplo-maxM test values obtained for both 2-marker 
and 3-marker combinations in the two portions of Region D, demonstrating an association of 
the second portion of Region D with schizophrenia. The comparison of the distribution of 
Haplo-maxM test values oin the two regions was analyzed using a Kruskal-Wallis rank test, a 
chi-square test with r-1 degrees of freedom, where r represents the number of value sets 
compared. As shown, the significance of the association is demonstrated by a chi-square value 

(one degree of freedom) of 34.839 and a p-value of less than 3.58x1 0" 9 for 2 marker 
combinations, and a chi-square value (one degree of freedom) of 13.773 and a p-value of 
-4 

2.6x 1 0 for 3- marker combinations. 

The results from the haplo-maxM tests further confirms the association shown using the 
Omnibus LR test results. 

Results of association studies discussed above using biallelic markers of the invention 
are further summarized in Table 13 below, showing a significant association of the biallelic 
markers with schizophrenia in both single biallelic marker and haplotype analysis. 



Table 13 





Single-point Analysis 


Multi-point analysis 
(Haplotype-based analysis) 




No. of allelic freq 
differences > 10% 


No. Significant 
allelic tests 


Omnibus LR TEST * 








2-mks 


3-mks 


4-mks 


Region D, 1st 
portion 


0 


0 


0,03 


0,05 


0,06 


Region D, 2nd 
portion 


0 


5 


0,30 


0,30 


0,31 



* percentage of significant tests (5% level of significance) 
Cases (N=2 1 3) / Controls (N=24 1 ) 



Example 5d 

Association Study Between Bipolar Disorder and the Biallelic Markers of the Invention 
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Description of study design 

Biallelic markers of the invention were analyzed in bipolar disorder cases. As in 
examples above, single and multi-point analyses showed a significant association of the markers 
of the invention, of Region D of the chromosome 13q33 locus, and more particularly of a sub- 
region of Region D with bipolar disorder. 

A) Description of the Affected population 

All the samples were collected from a study of bipolar disorder undertaken in a hospital 
located south of Buenos Aires, Argentina, generally representing a population estimated at 
about 400,000 inhabitants. Patients were evaluated by four doctors in 1994 and 1995. The 
study design involved in the ascertainment of cases and their first degree relatives (parents or 
siblings). 5 1 4 individuals were available for the study. This group consisted of 1 58 subjects 
from 51 different families, and 356 independent subjects. 

As a whole, bipolar disorder cases were ascertained according to the diagnosis of 
bipolar disorder established by the DSM-IV (Diagnostic and Statistical Manual, Fourth edition, 
Revised 1994, American Psychiatric Press); 

Available for consideration for each coded case were also age, sex, nationality of 
parents and grand parents, ethnic origin, familial composition, marital state, socio-economic 
level, educational level, professional situation, employment, receational activities, age of onset 
of phychiatric symptoms, age of first consultation, occurrences of obstetric or prenatal 
incidents, suicide attempts, other medical conditions, treatment for or occurrence of a 
neurological condition, familial occurrence of symptoms, previous or concurrent use of 
psychotropic drugs, other admissions to a hospital or medical treatments, and diagnostic reason 
for admission including (a) DSM-IV diagnosis and (b) symptoms first presented on admission 
to hospital. 

Available for study were 226 bipolar disorder ascertained cases of which 203 were 
independent cases. This group consisted of 51 cases from 5 1 families, 20 cases in relatives 
thereof, and 155 independent cases. Upon elimination of 3 cases from the initial independent 
1 55 cases due to discovery of a familial relation, the total number of independent cases was 203. 

Cases were classified according to bipolar disorder type. The cases included 1 1 5 
bipolar disorder type I individuals (including 1 rapid cycling case), 67 bipolar disorder type II 
individuals (including 1 rapid cycling case), 18 unclassified bipolar disorder cases, and 3 cases 
which remained unclassified due to lack of or inconsistent information. 

The 203 independent cases were examined for a familial history of psychosis. 53 of 
these cases reported an occurrence of psychosis (characterized as schizophrenia or bipolar 
disorder) among first degree relatives (father, mother, brothers, sisters or children). 
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B) Demotion of the Unaffected population 

Available for study were 201 controls which had not been affected by any psychiatric 
difficulties or reported any familial history of psychiatric difficulties. Available for 
consideration were also age, sex and ethnic origin of the unaffected population. 

Q Case and Control Populations Selected for the Association Study 

For the association study, the case population under study consisted of 201 individuals 
selected from the 226 total cases above; the control population consisted of 198 individuals 
selected from the 201 controls described above. 

The association data that are presented in the Example 5d below were obtained on a 
population size detailed in Table 14 below. 



Table 14 



Cases and Control Populations Selected for the Association 
Study 






Sample type 


Cases 


Controls 


Sample size 




201 


198 


Gender 










Male 


68 


81 




Female 


124 


117 




Missing 


9 




Ethnic origin 








Causasian 


182 


177 




Non Caucasian 


5 


21 




Missing 


14 




Familial history of psychosis (FH)* 








positive (FH+) 


54 


0 




none (FH-) 


147 


198 


* : close relatives (first degree) 







Both case and control populations form two groups, each group consisting of unrelated 
individuals that do not share a known common ancestor. 

Genotyping of affected and control individuals 

The general strategy was to individually scan the DNA samples from all individuals in 
each of the populations described above in order to establish the allele frequencies of biallelic 
markers, and among them the biallelic markers of the invention, in the diploid genome of the 
tested individuals belonging to each of these populations. 

Allelic frequencies of every biallelic marker in each population (cases and controls) 
were determined by performing microsequencing reactions on amplified fragments obtained by 
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genomic PCR performed on the DNA samples from each individual. Genomic PCR and 
microsequencing were performed as detailed above in Examples 1 to 4 using the described PCR 
and microsequencing primers. 

Association analysis 

5 The association analysis included 30 preferred biallelic markers of the invention located 

in Region D of the chromosome 13q3 1-33 region. Included in the analysis were the 14 
following biallelic markers from the first of two subjective portions of Region D: 99-26150/276 
(Al 18), 99-26156/290 (A120), 99-26153/44 (AI 19), 99-25985/194 (Al 15), 99-25974/143 
(Al 12), 99-25977/3 1 1 (Al 13), 99-25972/3 17 (Al 1 1), 99-25965/399 (A107), 99-25961/376 

10 (A 106), 99-25966/241 (A 108), 25967/57 (A 109), 99-25969/200 (Al 10), 99-25979/93 (Al 14) 

and 99-25989/398 (Al 16). Included in the analysis were also the 16 following biallelic markers 
from the second of two portions of Region D: 99-25993/367 (A71), 99-16047/1 15 (A70), 99- 
15875/165 (A68), 99-16033/244 (A67), 99-16032/292 (A66), 99-25940/182 (A65), 99- 
15880/162 (A58), 99-16038/1 18 (A56), 99-15870/400 (A53), 99-24649/186 (A48), 99- 

15 5897/143 (A47), 99-24644/194 (A19), 99-5919/215 (Al 1), 99-5862/167 (A10), 99-16100/147 

(A9), and 99-7652/162 (A8). 

A) Single biallelic marker association results in bipolar disorder cases 
For each allele of the biallelic markers included in this study, the difference between the 
allelic frequency in the unaffected population and in the population affected by bipolar disorder 

20 was calculated and the absolute value of the difference was determined. The set of biallelic 

markers and their allelic frequencies included in this study are set forth in Table 15. The more 
the difference in allelic frequency for a particular biallelic marker or a particular set of biallelic 
markers, the more probable an association between the genomic region harboring this particular 
biallelic marker or set of biallelic markers and bipolar disorder. Allelic frequencies were also 

25 useful to check that the markers used in the haplotype studies meet the Hardy- Weinberg 

proportions (under random mating assumptions) 



Table 15 



REGION 


CONTIG 


POSITION 
ON CONTIG 


SNPS GENOTYPED 


POLYMORPHISM 


FREQUENCY 
IN CONTROLS 






168,02 


99-26150/276 


A/G 


62,93 






173,29 


99-26156/290 


A/C 


72,42 






177,01 


99-26153/44 


A/C 


52,66 






186,41 


99-25985/194 


C/T 


28,87 






190,15 


99-25974/143 


A/G 


31,79 






216,43 


99-25977/311 


A/G 


63,82 



Region D 
first Half 
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224 62 


Q9-2S07?/^ 17 

77"LJ7 1 £.1 J 1 / 


err 


/ Z,JZ 






236,64 


99-2SQ6S/399 


A/n 


SR 9d 






244,82 


00-25961/176 


A/r 


A A *1C 






254,70 


99-25966/241 


A/G 


66,18 






257,85 


99-25967/57 


A/G 


42,44 






261,23 


99-25969/200 


G/T 


35,76 






263,67 


99-25979/93 


A/G 


67 15 






269,39 
7Q0 n? 


99-25989/398 

00 7^QQT/1A7 

77-ZJ77J/JO / 


A/G 
a in. 


/IT TO 






JUj,U4 


QO 1 £0/17/1 1 ^ 
77-1 0U4 //I 1 J 




oy,oi 






11% A7 


OO 1 ^Q7^/1 

yy- 1 jo / j/ioj 


U/l 


£ 1 1 






KA 91 


77" 1 OU JJ/Z44 


w 1 


50,3 






JOO,J I 


OO 1AnlO/7G7 

yy- iou jz/zyz 


A If 

A/C 


oz,o7 






1£7 1/1 
JO/, 14 


yy-zjy4u/ 1 oz 


A/vj 


54,39 








QQ 1 coon/1 

yy-ijoou/ioz 


A/U 


62,72 








QQ I *Lf\1QI\ 1 O 

yy-ioujo/i i o 


A If 

A/G 


37,29 






JO.?, 4 1 


oo i zmn/Ann 

yy-i Do/u/4uu 


A/U 


2t*,o5 








QQ 7/1£/1Q/1QjC 

yy-z404y/ 1 oo 


L/ 1 


66,57 






jyj,z/ 


OO ^C07/1/l*3 

yyooy // 14j 


a in 
A/c 


3Z,D 






*tU7,7j 


QQ-9d6dd/1 Od 

77"Z t TO t * t tf I 7*f 


A/n 


1R 70 






424,95 


99-5919/215 


A/G 


60,63 






441,62 


99-5862/167 


C/T 


46,53 






444,00 


99-16100/147 


A/G 


48,84 






445,84 


99-7652/162 


A/G 


49,7 


TOTAL 






30 







(1) : frequency (%) in Caucasian controls (N=177) of the first allele (alphabetic order) 
Region D was arbitrarily split in two halves (D 1 st half and D 2 nd half) for purpose of the 
analysis. 

5 The present inventors have previously demonstrated significant association of biallelic 

markers located on the Region D subregion of the human chromosome 1 3q3 1-33 region with 
disease. Using a set of 30 biallelic markers shown in Table 15, D 1 st half contained 14 markers 
and D 2 nd half contained 16 markers. 

Table 1 5 also shows the physical order of the biallelic markers on Region D of the 
10 human chromosome 13q31-q33 region. The mean intermarker distances of the biallelic markers 

on the first and the second subjective portions of Region D were as listed below in Table 16. 

Table 16 
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Mean 

Inter-marker distance (std) 


D 1 51 half 


7.80(6.33) 


D 2 nd half 


9.79 (8.78) 


D 1 st and 2 nd half 


9.58 (8.46) 



The analysis using selected Region D biallelic markers of the invention demonstrated a 
significant association with bipolar disorder for the second portion of Region D. The analysis 
was conducted using the sample population described above with 182 Caucasian cases and 177 
5 Caucasian controls selected from the total case and control group. 

One biallelic marker in particular, 99-1 5875/ 165(A68), located on the second half of 
Region D, demonstrated a significant association with disease at a significance level of better 
than 5% (corresponding to an absolute logarithm (p-value) of 1 .3). 

B) Haplotype association results in bipolar disorder cases 
1 0 Haplotype analysis for association of chromosome 1 3q3 1 -q33-related biallelic markers and 

bipolar disorder was performed by estimating the frequencies of all possible 2, 3 and 4 marker 
haplotypes in the affected and control populations described above. Haplotype frequencies 
estimations were performed by applying the Expectation-Maximization (EM) algorithm (Excoffier 
and Slatkin, 1995), modified by Nicholas Schork. 
15 Significant results were obtained in haplotype studies indicating an association of the 

nucleotide sequences of the invention with bipolar disorder. The haplotype analysis as shown in 
the Figures 9 A, 9B, I OA, 10B, 1 1 A and 1 IB was conducted using the sample population 
described above, using 182 Caucasian cases and 177 Caucasian controls selected from the total 
case and control group. 

20 Using the Omnibus LR test which compares the profile of haplotype frequencies, and 

HapIo-maxM test which is based on haplotype frequencies differences for each haplotype in 
two groups, Figures 9 A, 9B, 10A, 10B, 1 1 A and 1 1 B show the results of a comparison of the 
first and second portions of Region D which demonstrated an association of the second portion 
of Region D with bipolar disorder. 

25 a - Omnibus LR tests values 

For a given combination of 2, 3 or 4 biallelic markers, one likelihood ratio test (LR test) 
is obtained based on the haplotype frequencies values calculated using the E-M algorithm. 

Figures 9A and 9B show a comparison of the LR test value distributions of haplotype 
frequencies in the two portions of Region D. This association of the second portion of Region 
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D with bipolar disorder is shown using both 2-marker and 3-marker combinations. A Kruskall 
Wallis rank test was used to compare LR test values distributions in the two subjective portions 
of Region D. This test has an asymptotic Chi-square distribution, under the null hypothesis of 
no difference between the sets compared, with (r-1) degrees of freeedom, where r represents the 
number of sets compared. Here, we compare the 2 portions of region D, so r=2, and the 
asymptotic Chi-square distribution has 1 degree of freedom. As shown, the significance of the 
association is demonstrated by a chi-square value (one degree of freedom) of 46.62 and a p- 
-12 

value of 8.62x10 for 2 marker combinations, and a chi-square value (one degree of freedom) 

-29 

of 1 24.72 and a p-value of 5.86x1 0 for 3- marker combinations. 

b - Haplo-max tests values 

Another association analysis approach based on haplotype frequencies differences, 
referred to as the Haplo-max test, was conducted using region D biallelic markers. The haplo- 
max test selects the difference showing the maximum positive (maxM) or negative (maxS) test 
value between cases versus controls (rejecting test values based on rare haplotype frequencies, 
i.e, with an estimated number of haplotypes carriers inferior to 10) ; for one combination of 
markers there is therefore one Max-M and one Max-S test values. 

Figures 10A and 10B show the distribution of haplo-maxM test values obtained for both 

2-marker and 3-marker combinations in the two portions of Region D, demonstrating an 

association of the second portion of Region D with bipolar disorder. The comparison of the 

distribution of Haplo-maxM test values in the two regions was analyzed using a Kruskal-Wallis 

rank test, a chi-square test with 1 degree of freedom. As shown, the significance of the 

o 

association is demonstrated by a chi-square value of 29.07 and a p-value 6.98x10 for 2 marker 

combinations, and a chi-square value of 98.63 and a p-value of 3.04x10 for 3- marker 
combinations. 

Figures 1 1A and 1 IB show the distribution of Haplo-maxS test values again obtained 
for all 2-marker and 3-marker combinations in the two portions of Region D, demonstrating an 
association of the second portion of Region D with bipolar disorder. The comparison of the 
distributions of Haplo-maxS test values in the two portions was analyzed using a Kruskal- 
Wallis rank test with one degree of freedom. As shown, the significance of the association is 

o 

demonstrated by a chi-square value of 34.6 and a p-value of 4.05x10 for 2 marker 

combinations, and a chi-square value of 98.31 and a p-value of 3.58x10 for 3- marker 
combinations. 

The results from the haplo-maxM and hapIo-maxS tests thus further confirm the 
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association shown using the Omnibus LR test results. 



Example 5e 

Identification of Common Haplotypes in Three Schizophrenia Populations 

5 

Based on the results above indicating an association of biallelic markers located in 
Region D3 with schizophrenia, further biallelic markers in Region D3 were screened in three 
different populations of schizophrenia cases and controls. The D3 region is an approximately 
150kb genomic region containing the g35030 gene. 

10 Several significant haplotypes comprised of markers in or flanking the g35030 gene and 

common to the three populations were identified. These haplotypes demonstrated significance 
in Omnibus LR and individual haplotype frequency tests. Furthermore, the estimated 
frequencies of the haplotypes were highly similar for each of the different populations. 
The samples used were obtained from a French Canadian population, an Eastern 

1 5 European population, and a United States sample. The French Canadian schizophrenia sample is 

described above in Example 5a, and Tables 7 and 8. The United States schizophrenia sample is 
described in Table 17 below. The Eastern European sample, is described in Table 18 below. 



Table 17 



United States Schizophrenia Cases and Control Populations 
(United States Caucasians) 






Sample type 


Cases 


Random US 
Controls 


Sample size 




131 


188 


Ethnic origin 








European Causasians 
(26 female, 66 male) 


92 






Ashkenazi Caucasians 
(7 female, 1 7 female) 


24 






Other Caucasians 
(7 female, 8 male) 


15 




Familial history of psychosis (FH) 








positive (FH+) 


133 






none (FH-) 


147 


198 



20 
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Table 18 



Eastern European (Rogaev) Schizophrenia Cases and 
Control Populations 






Sample type 


Cases 


Controls 


Sample size 




183 


183 


Ethnic origin: European Causasians 
(97 female, 86 male) 






Familial history of psychosis (FH) 








positive (FH+) 


55 






none (FH-) 


128 





For each of the three populations, multipoint analyses were carried out using biallelic 
markers from the D3 region. The haplotypic frequency differences between case and controls 
was examined, reported as MaxM for the maximum positive difference, and MaxS as the 
maximum negative difference, and the profile of haplotype frequency differences, where the 
distribution of haplotype frequency differences is described using the Omnibus LR test. 

Several significant two-marker haplotypes were found to be in common among the 
three populations. One haplotype, referred to haplotype 2, is described further in Tables 19 and 
20. Other haplotypes are comprised of biallelic markers which are in linkage disequilibrium 
with one of the biallelic markers of haplotype 2, and thus also provide significant results. 
Biallelic markers 99-15875-165 (A68) (allele T) and 99-5862-167, involved in haplotype 2, 
demonstrated a range of estimation of haplotype frequencies of 23.6 to 24.5 in cases, and 15.5 
to 17.6 in controls. Again, similar differences in frequency estimates were seen with other 
haplotypes. Markers 99-1 5875-1 65 (A68) (allele T) and 99-5862-167 are located in or flanking 
the g35030 gene. 

Table 19 shows the results of analysis of estimated haplotype frequencies for haplotype 
2, comprising biallelic markers 99-15875-165 (A68) (allele T) and 99-5862-167 (A10) (allele 
T). Presented are the results of a likelihood ratio test (LR test), demonstrating a significant p- 
value in each of the three populations; the results of the LR test on which a permutation 
procedure has been carried out (Omnibus), demonstrating a significant p-value of 5*10-3 for 
French Canadian samples, 8*10-2 for Eastern European (Rogaev) samples, and 1*10-2 for U.S. 
samples; and the estimated haplotype frequencies in cases and controls, and their positive or 
negative difference (MaxM). Notably, the haplotype frequencies in the three populations are 
highly similar at 23.60, 23.90 and 24.50 for cases and 15.5, 17.6 and 15.5 for controls. 
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Table 19 

HAP2: " 
99-15875/165 (T), 99-5862/167 (T) 



POP 


haplotype 


Pvalue- 
hap 


Omnibus 
(1000 
perm) 


freq 
cas 


freq 
controls 


Difference 


French Canadian 


TT 

LDNormalized:-0.16) 


2.20E- 
03 


5,O0E-03 


23,60 


15,5 


8,10 


axM 


Eastern European 


4.60E- 
02 


8.70E-02 


23,90 


17,6 


6,3 


axM 


U.S. 


8.60E- 
03 


l,30E-02 


24,50 


15,5 


9 


axM 



5 Example 6 

Preparation of Antibody Compositions to the g35030 protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
containing an expression vector encoding the g35030 protein or a portion thereof. The 
concentration of protein in the final preparation is adjusted, for example, by concentration on an 
1 0 Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to 

the protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes in the g35030 protein or a portion thereof can be prepared 
from murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 
1 5 256:495 (1 975) or derivative methods thereof. Also see Harlow, E., and D. Lane. 1 988. 

Antibodies A Laboratory Manual. Cold Spring Harbor Laboratory, pp. 53-242. 

Briefly, a mouse is repetitively inoculated with a few micrograms of the g35030 protein or 
a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 
20 with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 

selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of the dilution placed in wells of a microliter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
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fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 
(1980), and derivative methods thereof. Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
production are described in Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New 
York. Section 21-2. 

B. Polyclonal Antibody Production bv Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the g35030 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 
g35030 protein or a portion thereof, which can be unmodified or modified to enhance 
immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 
usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 
enriched for g35030 concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 
adjuvant (e.g. aluminum hydroxide, RJBI, etc.) which is known in the art. In addition the 
protein, fragment or preparation can be pretreated with an agent which will increase 
antigenicity, such agents are known in the art and include, for example, methylated bovine 
serum albumin (mBSA), bovine serum albumin (BSA), Hepatitis B surface antigen, and 
keyhole limpet hemocyanin (KLH). Serum from the immunized animal is collected, treated and 
tested according to known procedures. If the serum contains polyclonal antibodies to undesired 
epitopes, the polyclonal antibodies can be purified by immunoaffinity chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small 
doses (ng level) of antigen administered at multiple intradermal sites appears to be most 
reliable. Techniques for producing and processing polyclonal antisera are known in the art, see 
for example, Mayer and Walker (1987). An effective immunization protocol for rabbits can be 
found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion 
in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, 
O. et al., (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of 
serum (about 1 2 uJvl). Affinity of the antisera for the antigen is determined by preparing 
competitive binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of 
Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., Washington, 
D.C.(I980). 
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Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 
identify the presence of antigen in a biological sample. The antibodies may also be used in 
5 therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 

in the body. 

The disclosures of all issued patents, published PCT applications, scientific references or 
other publications cited herein are incorporated herein by reference in their entireties. 
10 Although this invention has been described in terms of certain preferred embodiments, 

other embodiments which will be apparent to those of ordinary skill in the art of view of the 
disclosure herein are also within the scope of this invention. Accordingly, the scope of the 
invention is intended to be defined only by reference to the appended claims. 
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Sequence listing free text 

The following free text appears in the accompanying Sequence Listing: 

schizophrenia 

associated 

proteins 

biallelic 

markers 

regulatory 

region 

gene 

exon 

complement 
polymorphic base 
or 

allele 

deletion 

of 

basic 

protease 

cleavage 

site 

probe 

primer 

downstream 

upstream 

amplification 

sequencing 

oligonucleotide 



WO 01/40493 PCT/IB00/01507 

195 

The disclosures of the following references are incorporated herein by reference in their 
entireties: 

Abbondanzo S.J. etal. (1993) Methods in Enzymology, Academic Press, NewYork. pp. 803- 
823. 

AjiokaR.S. etal. (1997) Am. J. Hum. Genet.60: 1439- 1447. 
Altschuletal., 1990, J. Mol. Biol. 215(3):403-410; 
Altschul et al., 1993, Nature Genetics 3:266-272 
AItschuletaI.,1997,Nuc. Acids Res. 25:3389-3402 
Ames, R.S. et al. J. Immunol. Methods 184: ] 77-186(1995) 
Anton M. etal., 1995, J. Virol.,69: 4600-4606. 
ArakiKetal. (1995) Proc. Natl. Acad. Sci. USA. 92(l):I60-4. 
Ashkenazi, A. et al. PNAS 88:10535-10539(1991) 

Ausubel et al. ( 1 989)Current Protocols inMolecular Biology, Green Publishing Associates and 

Wiley Interscience,N.Y. 

Bartunek, P. etal. Cytokine 8(1): 14-20 (1996) 

BaubonisW. (1993) Nucleic Acids Res. 21(9):2025-9. 

Beaucage et al., Tetrahedron Lett 1981,22: 1859-1862 

Better, M. et al. Science 240:1041-1043(1988) 

Bradley A.,(1987) Production and analysis of chimaeric mice. In:E.J. Robertson(Ed.), 
Teratocarcinomas and embryonic stem cells: A practical approach.IRL Press, Oxford, pp. 113. 
Brinkman U. et al. J. Immunol. Methods 182:41-50(1995) 

Brown EL, Belagaje R, Ryan MJ, KhoranaHG, Methods Enzymol 1979;68: 109-1 5 1 

Brutlagetal. Comp. App.Biosci. 6:237-245,1990 

Burton, D.R. et al. Advances in Immunology 57:191-280(1994) 

Carlson, N.G. etal. J. Biol. Chem. 272(1 7): 1 1295-1 1301(1997) 

ChaiH. etal. (1993) Biotechnol. Appl.Biochem. 1 8:259-273. 

Cheeetal. (1996) Science. 274:610-614. 

Chen and Kwok Nucleic Acids Research 25:347-353 1997 

Chen etal.( 1987) Mol. Ceil. Biol. 7:2745-2752. 

Chen et al., 1987, Mol. Cell.Biol., 7: 2745-2752. 

Chen, Z. et al. Cancer Res. 58(16): 3668-3678(1998) 

ChouJ.Y. (1989) Mol. Endocrinol .3:151 1-1514. 

Clark A.G. (1990) Mol. Biol. Evol. 7:111-122. 

Coles etal. Hum. Mol. Genet, 7:791-800, 1998 

ComptonJ. (1991)Nature. 350(63 1 3):91-92. 



WO 01/40493 PCT/IBOO/01507 

196 

Davis et al., Basic Methods in MoIecularBiology, ed., Elsevier Press, NY, 1986 
Dempster etal., (1977) J.R. Stat. Soc, 39B:l-38. 
Deng, B. et al. Blood 92(6): 1981-1988(1998) 

Dent D.S. and Latchman D.S. (1993) TheDNA mobility shift assay. In; Transcription Factors: 

A PracticalApproach (Latchman DS, ed.) Oxford: IRL Press, ppl-26. 

EcknerR. etal. (1991)EMBOJ. 10:3513-3522. 

ExcoffierL. and Slatkin M.( 1995) Mol. Biol. Evol., 12(5): 921-927. 

Feldman and Steg, 1996,Medecine Sciences, synthese, 12:47-55 

Fell, H.P. etal. J. Immunol. 146:2446-2452(1991) 

Flotteetal. (1992) Am. J.Respir. Cell Mol. Biol. 7:349-356. 

Fodor et al. (1991) Science25 1:767-777. 

Fraleyetal. (1979) Proc. Natl. Acad. Sci. USA.76:3348-3352. 

Fried M. and Crothers D.M. (1981) Nucleic Acids Res.9:6505-6525. 

Fuller S. A. et al. (1996) Immunology in CurrentProtocols in Molecular Biology, Ausubel et 

al.Eds, John Wiley & Sonsjnc, USA. 

FurthP.A. etal. (1994) Proc. Natl. Acad. Sci USA.91:9302-9306. 

Garner M.M. andRevzinA. (1981) Nucleic AcidsRes.9:3047-3060. 

GeysenH. Mario etal. 1984. Proc. Natl. Acad.Sci. U.S.A. 81:3998-4002 

Ghosh and Bacchawat (1991) Targeting ofliposomes to hepatocytes, IN: Liver Diseases, 

Targeted diagnosis andtherapy using specific rceptors and ligands. Wu et al.Eds., 

MarcelDekeker, New York, pp. 87-104. 

Gillies, S.D. etal. J. Immunol. Methods 125:191-202(1989) 

Gillies, S.O. et al. PNAS 89:1428-1432(1992) 

GonnetetaL, 1992, Science256: 1443- 1445; 

Gopal (1985) Mol. Cell. Biol., 5:1 188-1 190. 

GossenM. etal. (1992) Proc. Natl. Acad. Sci. USA.89:5547-5551. 

GossenM. etal. (1995) Science. 268:1766-1769. 

Graham et al. (1973) Virology 52:456-457. 

Green etal. (1986)Ann. Rev. Biochem. 55:569-597. 

Greenspan and Bona, FASEB J. 7(5):43 7-444 (1989) 

Griffin et al.(1989) Science.245:967-971. 

Grompe,M. (1993) Nature Genetics. 5:111-117. 

Grompe,M. etal. (1989) Proc. Natl. Acad. Sci. U.S.A.86:5855-5892. 

GuH. etal. (1993) Cell 73:1 155-1 164. 

Guatelli J C etal. Proc. Natl. Acad. Sci. USA. 35:273-286. 



WO 01/40493 PCT/IB00/01507 

197 

Hacia JG, Brody LC, Chee MS, Fodor SP, Collins FS, Nat Genet 1 996; 14(4):44 1-447 
Haff L. A. and Smirnov I. P. (1997) GenomeResearch, 7:378-388. 
Hames B.D. and Higgins S J. (1985) Nucleic Acid Hybridization: A Practical Approach. 
Hames and Higgins Ed., IRLPress, Oxford. 
5 Hammeriing, et al., in: MONOCLONAL ANTIBODIES AND T-CELL HYBRIDOMAS 

563-681 (Elsevier, N.Y., 1981) 

Harju L, Weber T, Alexandrova L, Lukin M, Ranki M,Jalanko A, Clin Chem 1993;39(1 lPt 
l):2282-2287 

Harlandetal.(1985)J. Cell. Biol. 101:1094-1095. 
10 Harrop, J. A. et al. J. Immunol. 161(4): 1786-1794(1998) 

HawIeyM.E. etal. (1994) Am.J. Phys. Anthropol. 18:104. 
Henikoffand Henikoff, 1993, Proteins 17:49-6 1 
Higgins etal., 1996, Methods Enzymol. 266:383-402; 

HillierL. andGreenP. Methods AppL, 1991, 1: 124-8. Gu H. et al.( 1994) Science 265: 103- 
15 106. 

Hoessetal. (1986) Nucleic AcidsRes. 14:2287-2300. 

HuangL. etal. (1996) Cancer Res56(5):l 137-1141. 

Huston et al. Methods in Enzymology 203:46-88(1991) 

Huygen et al. (1996) Nature Medicine.2(8):893-898. 
20 IzantJ.G. and Weintraub H. (1984) Cell36(4): 1007-1 01 5. 

Juianetal. (1992) J. Gen. Virol. 73:3251-3255. 

KanegaeY. eta)., Nucl. AcidsRes. 23:3816-3821. 

Karlin andAltschul, 1990, Proc. Natl. Acad. Sci. USA 87:2267-2268; 

Kettleborough, C.A. et al. Eur. J. Immunol. 24:952-958(1994) 
25 Khoury J.et al. (1 993) Fundamentals of Genetic Epidemiology, Oxford UniversityPress, NY. 

KimU-J. etal. (1996) Genomics 34:213-218. 

Kleinetal. (1987) Nature. 327:70-73. 

Kolleretal. (1992) Annu.Rev. Immunol. 10:705-730. 

Kozal MJ, Shah N, Shen N, Yang R,Fucini R, Merigan TC, Richman DD, Morris D, Hubbell E, 
30 Chee M, GingerasTR, Nat Med 1996;2(7):753-759 

LandegrenU. etal. (1998) GenomeResearch, 8:769-776. 

Lander and Schork, Science, 265, 2037-2048,1994 

LenhardT. etal. (1996) Gene. 169:187-190. 

Liautard, J. et al. Cytokinde 9(4):233-241(1997) 
35 Linton M.F. etal. (1993) J. Clin. Invest. 92:3029-3037. 



WO 01/40493 PCT/IB00/01507 

198 

LiuZ. etal. (1994)Proc. Natl. Acad. Sci. USA. 91:4528-4262. 

Livak et al., Nature Genetics, 9:341-342, 1995 

Livak KJ, Hainer JW, Hum Mutatl994;3(4):379-385 

Lockhart et al.(1996) Nature Biotechnology 14: 1675- 1680. 
5 MansourS.L. etal. (1988) Nature. 336:348-352. 

Marshall R. L. etal. (1994) PCR Methods and Applications.4:80-84. 

McCormicketal. (1994) Genet. Anal. Tech. Appl.l 1:158-164. 

McLaughlin B.A. etal. (1996) Am. J. Hum. Genet.59:561-569. 

Morton N.E. (1955) Am. J. Hum. Genet. 7:277-318. 
1 0 Morrison, Science 229: 1 202 ( 1 985) 

Muller, Y.A. et al. Structure 6(9): 1 153-1 167(1998) 

Mullinax, R.L. et al. BioTechniques 1 2(6) : 864-869( 1 992) 

Muzyczka et al. (1992) Curr. Topics in Micro, and Immunol. 158:97-129. 

Padlan E.A., Molecular Immunology 28(4/5): 489-498(1991) 
15 NadaS. etal. (1993) Cell 73:1 125-1135. 

NagyA. etal. (1993) Proc. Natl. Acad. Sci. USA. 90:8424-8428. 

Naramura, M. et al. Immunol. Lett. 39:91-99(1994) 

Narang SA, Hsiung HM, Brousseau R, Methods Enzymol 1979;68:90-98 

Nedaetal. (1991) J. Biol. Chem. 266:14143-14146. 
20 Newton etal. (1989)Nucleic Acids Res. 17:2503-2516. 

Nickerson D.A. etal. (1990)Proc. Natl. Acad. Sci. U.S.A. 87:8923-8927. 

Nicolau etal.(1982) Biochim. Biophys. Acta. 721:185-190. 

Nissinoff, J. Immunol. 147(8): 2429-2438 (1991). 

Nyren P, Pettersson B,Uhlen M, Anal Biochem 1993;208(1):171-175 
25 O'Reilly et al. (1992)Baculovirus Expression Vectors: A Laboratory Manual. W. H. Freeman 

andCo., New York. 

Ohnoetal. (1994) Science. 265:781-784. 
Oi et al., BioTechniques 4:214 (1986) 

Oritaetal. (1989) Proc. Natl. Acad. Sci. U.S.A.86: 2776-2770. 
30 Ott J.( 1991) Analysis of Human Genetic Linkage. John Hopkins University Press,Baltimore. 

Pastinen et al., Genome Research 1997; 7:606-614 

Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8):2444-2448; 
Pease S. and William R.S. (1990) Exp. Cell. Res. 190:09-211. 
Perlinetal. (1994) Am. J. Hum. Genet. 55:777-787. 
35 Persic, L. etal. Gene 187 9-18(1997) 



WO 01/40493 PCT/IB00/01507 

199 

Peterson etal.(l 993) Proc. Natl. Acad. Sci. USA. 90:7593-7597. 
Pietu et al.(1996) Genome Research.6:492-503. 
Pitard, V. et al. J. Immunol. Methods 205(2): 177- 190(1 997) 
Potter etal. (1984) Proc. Natl.Acad. Sci. U.S.A. 81 (22):71 6 1 -7165. 
5 Prat, M. et al. J. Cell. Sci. 1 1 l(Pt2): 237-247(1998) 

Rayl et al., (1996) J. Bio.Chem, 271, 2225-2233. 
ReidL.H. etal. (1990) Proc. Natl. Acad.Sci. U.S.A. 87:4299-4303. 
Risch,N. and Merikangas, K. (1996)Science. 273:1516-1517. 
Robertson E. (1987) "Embryo-Derived StemCell Lines." In: EJ. Robertson Ed. 
1 0 Teratocarcinomas And Embrionic Stem Cells: A Practical Approach. IRL Press, Oxford, pp. 

71. 

Roguska MA. et al. PNAS 91 :969-973), (1994) 
Rossietal. (1991) Pharmacol. Then 50:245-254. 
Roth J.A. etal. (1996)Nature Medicine. 2(9):985-99l. 
15 Rouxetal. (1989) Proc. Natl. Acad.Sci. U.S.A. 86:9079-9083. 

Ruanoetal. (1990) Proc. Natl. Acad.Sci. U.S.A. 87:6296-6300. 

Sambrook, J., Fritsch, E.F., and T.Maniatis. (1989) Molecular Cloning: A Laboratory Manual. 

2ed. ColdSpring Harbor Laboratory, Cold Spring Harbor, New York. 

Samson M, etal. (1996) Nature, 382(6593):722-725. 
20 Samulskietal. (1989) J.Virol. 63:3822-3828. 

Sanchez-Pescador R. (1988) J. Clin.Microbiol. 26(10):I934-1938. 

Sarkar, G. and Sommer S.S. (1991)Biotechniques. 

SauerB. etal. (1988) Proc. Natl. Acad. Sci.U.S.A. 85:5166-5170. 

Sawai, H. et al. AJRI 34:26-34(1995) 
25 SchaidDJ. etal. (1996) Genet. Epidemiol.l3:423-450. 

SchedlA. etal. (1993a) Nature. 362:258-261. 

Schedl et al. (1993b) Nucleic Acids Res. 21:4783-4787. 

Schedletal. (1993b) Nucleic Acids Res. 21:4783-4787. 

Schenaetal. (1995)Science. 270:467-470. 
30 Schenaetal. (1996) Proc. Natl. Acad. Sci.U.S.A. 93(20): 1061 4-1 0619. 

Schneider et al.(1997) Arlequin: ASoftware For Population Genetics Data Analysis. University 

of Geneva. 

Schwartz and DayhofF, eds., 1978, Matrices for Detecting DistanceRelationships: Atlas of 
Protein Sequence and Structure, Washington :National Biomedical Research Foundation 
35 SczakielG. etal. (1995)Trends Microbiol. 3(6):2 13-21 7. 



WO 01/40493 PCT/IB00/01507 

200 

Shay J. W. etal. (1991) Biochem.Biophys. Acta. 1072:1-7. 

Sheffield, V.C. etal. (1991) Proc.Natl. Acad. Sci. U.S.A. 49:699-706. 

ShizuyaetaL (1992) Proc.Natl. Acad. Sci. U.S.A. 89:8794-8797. 

Shoemaker DD, Lashkari DA,Morris D, Mittmann M, Davis RW, Nat Genet 1996;14(4):450- 
5 456 

Shu, L. etal. PNAS 90:7995-7999(1993) 

Skerra, A. et al. Science 240:1038-1040. (1988) 

Smith(1957) Ann. Hum. Genet. 21:254-276. 

Smith etal. (1983) Mol. Cell.Biol. 3:2156-2165. 
10 Sosnowski R.G. etal. (1997) Proc. Natl. Acad.Sci. U.S.A. 94:1119-1123. 

SpielmannS. andEwensW. J. (1998) Am. J.Hum. Genet. 62:450-458. 

SpielmannS. etal. (1993) Am. J. Hum.Genet. 52:506-516. 

SternbergN.L. (1992) Trends Genet. 8:1-16. 

Sternberg N.L. (1994) Mamm. Genome. 5:397-404. 
15 Studnicka G.M. et al. Protein Engineering 7(6):805-8!4(1994) 

Syvanen AC, ClinChim Acta 1994;226(2):225-236 

Tacsonetal. (1996)NatureMedicine. 2(8):888-892. 

Taryman, R.E. et al. Neuron 14(4):755-762(1995) 

Te Riele et al. (1990) Nature.348:649-651. 
20 Terwilliger J.D. andOttJ. (1994) Handbook of HumanGenetic Linkage. John Hopkins 

University Press, London. 

Thomas K.R.et al. ( 1 986) Cell. 44:4 1 9-428. 

Thomas K.R. etal. (1987)Cell. 51:503-512. 

Thompson et al., 1994, Nucleic Acids Res.22(2):4673-4680; 
25 Tur-Kaspa et al. (1986) Mol. Cell. Biol.6:716-718. 

Tyagietal. (1998) Nature Biotechnology. 16:49-53. 

Urdea M.S. (1988) Nucleic Acids Research. 1 1:4937-4957. 

UrdeaM.S. et al.( 1991) Nucleic Acids Symp. Ser. 24:197-200. 

Van derLugt et al. (1991) Gene. 105:263-267. 
30 Vil, H. et al. PNAS 89:1 1337-1 1341(1992) 

VlasakR. etal. (1983)Eur. J. Biochem. 135:123-126. 

Wabiko et al. (1986)DNA.5(4):305-314. 

Walker etal. (1996) Clin. Chem. 42:9-13. 

Weir, B.S. (1996) Genetic data Analysis II: Methods for Discretepopulation genetic Data, 
35 Sinauer Assoc., Inc., Sunderland, MA, U.S.A. 



WO 01/40493 PCT/IB00/01507 

201 

White, M.B. etal. (1992) Genomics. 12:301-306. 
White, M.B. etal. (1997) Genomics. 12:301-306. 
Wong et al. (1980) Gene. 10:87-94. 

WoodS.A. etal. (1993) Proc. Natl. Acad. Sci. U.S.A.90:4582-4585. 
5 Wu and Wu( 1987) J. Biol. Chem. 262:4429-4432. 

Wu and Wu (1988) Biochemistry. 27:887-892. 
Wuetal. (1989)Proc. Natl. Acad. Sci. U.S.A. 86:2757. 
YagiT. etal. (1990) Proc.Natl. Acad. Sci. U.S.A. 87:9918-9922. 
Yoon, D.Y. et al. J. Immunol. 160(7): 3170-3 179(1998) 
10 Zhao etal. (1998) Am.J. Hum. Genet. 63:225-240. 
Zheng, X.X. et al. J. Immunol. 154:5590-5600(1995) 
Zhu, Z. et al. Cancer Res. 58(15): 3209-3214(1998) 
ZouY. R. etal. (1994) Curr. Biol.4:I099-l 103 



WO 01/40493 
CLAIMS 



202 



PCT/IB00/01507 



1 . An isolated, purified or recombinant polynucleotide comprising a contiguous span of at least 
30 nucleotides selected from the group consisting of: 

(a) SEQ ID Nos. 2 to 1 7, and the complements thereof; 

(b) SEQ ID Nos 24, 25, 28, 74 and 79 to 132, and the complements thereof; and 

(c) at least one of nucleotide positions 199122 to 249803 of SEQ ID No 1 . 

2. An isolated, purified or recombinant polynucleotide according to claim 1, wherein said 
contiguous span of SEQ ID No 1 or the complements thereof comprises at least one of the 
following nucleotide positions of SEQ ID No 1 : 201 123 to 201234, 201 123 to 201560, 214676 
to 2 1 4793, 2 1 5702 to 2 1 5746, 2 1 6836 to 2 1 6994, 2 1 6836 to 2 1 7077, 2 1 767 1 to 2 1 7764, 
227655 to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 
241291, 244353 to 244561 and 246273 to 247802; 

3. An isolated, purified or recombinant polynucleotide according to claim 1, wherein said span 
comprises a biallelic marker selected from the group consisting of A13 to A 18, A20 to A46, 
A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 . 

4. A recombinant vector comprising a polynucleotide according to any one of claims 1 to 3. 

5. A host cell comprising a recombinant vector according to claim 4. 

6. A non-human host animal or mammal comprising a recombinant vector according to claim 
5. 

7. A mammalian host cell comprising a g35030 gene disrupted by homologous recombination 
with a knock out vector, comprising a polynucleotide according to any one of claims 1 to 3. 

8. A non-human host mammal comprising a g35030 gene disrupted by homologous 
recombination with a knock out vector, comprising a polynucleotide according to any one of 
claims 1 to 3. 

9. Use of a polynucleotide comprising a contiguous span of at least 12 nucleotides of a sequence 
selected from the group consisting of SEQ ID Nos. 2 to 17, SEQ ID Nos 24, 25, 28, 74 and 79 
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to 132, at least one of nucleotide positions 199122 to 249803 of SEQ ID No I, and the 
complementary sequences thereto, for determining the identity of the nucleotide at a biallelic 
marker selected from the group consisting of A13 to Al 8, A20 to A46, A49 to A52, A55, A57, 
A59 to A63, A72 to A73, A76 and A123 . 

5 

10. Use of a polynucleotide according to claim 9 in a microsequencing assay, wherein the 3* 
end of said contiguous span is located at the 3' end of said polynucleotide and wherein the 3' 
end of said polynucleotide is located 1 nucleotide upstream of a biallelic marker in said 
sequence. 

10 

1 1 . Use of a polynucleotide according to claim 10 in a hybridization assay, wherein said span 
includes a biallelic marker. 

12. A polynucleotide according to any one of claims 1 to 3 attached to a solid support. 

15 

13. An array of polynucleotides comprising at least one polynucleotide according to claim 12. 

14. An array according to claim 13, wherein said array is addressable. 

20 1 5. A polynucleotide according to any one of claims 1 to 3 or 12 further comprising a label. 

16. A method of genotyping comprising determining the identity of a nucleotide at a biallelic 
marker selected from the group consisting of A13 to Al 8, A20 to A46, A49 to A52, ASS, A57, 
A59 to A63, A72 to A73, A76 and A 123 or the complement thereof in a biological sample. 

25 

17. A method according to claim 16, wherein said biological sample is derived from a single 
subject. 

18. A method according to claim 16, wherein said biological sample is derived from multiple 
30 subjects. 

19. A method according to claim 16, further comprising amplifying a portion of said sequence 
comprising the biallelic marker prior to said determining step. 
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20. A method according to claim 19, wherein said amplifying is performed by PCR. 
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21 . A method according to claim 1 6, wherein said determining is performed by a hybridization 
assay. 

5 22. A method according to claim 16, wherein said determining is performed by a sequencing 

assay. 

23. A method according to claim 16, wherein said determining is performed by a 
microsequencing assay. 

10 

24. A method according to claim 16, wherein said determining is performed by an enzyme- 
based mismatch detection assay. 

25. A method of estimating the frequency of an allele of a biallelic marker in a population 
15 comprising: 

a) genotyping individuals from said population for said biallelic marker according to the method 
of claim 16; and 

b) determining the proportional representation of said biallelic marker in said population. 

20 26. A method of detecting an association between a genotype and a trait, comprising the steps 

of: 

a) determining the frequency of at least one biallelic marker in trait positive population 
according to the method of claim 25; 

b) determining the frequency of at least one biallelic marker in a control population according to 
25 the method of claim 25; and 

c) determining whether a statistically significant association exists between said genotype and 
said trait. 

27. A method of estimating the frequency of a haplotype for a set of biallelic markers in a 
30 population, comprising: 

a) genotyping at least one biallelic marker according to claim 16 for each individual in 
said population; 

b) genotyping a second biallelic marker by determining the identity of the nucleotides at 
said second biallelic marker for both copies of said second biallelic marker present in the 

35 genome of each individual in said population; and 
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c) applying a haplotype determination method to the identities of the nucleotides 
determined in steps a) and b) to obtain an estimate of said frequency. 

28. A method according to claim 27, wherein said haplotype determination method is selected 
5 from the group consisting of asymmetric PCR amplification, double PCR amplification of 

specific alleles, the Clark method, or an expectation maximization algorithm. 

29. A method of detecting an association between a haplotype and a trait, comprising the steps 
of: 

1 0 a) estimating the frequency of at least one haplotype in a trait positive population according to 

the method of claim 27; 

b) estimating the frequency of said haplotype in a control population according to the method of 
claim 27; and 

c) determining whether a statistically significant association exists between said haplotype and 
15 said trait. 

30. A method according to claim 26, wherein said genotyping of step a) is performed on each 
individual of said population. 

20 3 1 . A method according to claim 26, wherein said genotyping is performed on a single pooled 

biological sample derived from said population. 

32. A method of detecting an association between an allele and a phenotype, comprising the 
steps of: 

25 a) determining the frequency of at least one biallelic marker allele in a trait positive population 

according to the method of claim 25; 

b) determining the frequency of said biallelic marker allele in a control population according to 
the method of claim 25; and 

c) determining whether a statistically significant association exists between said allele and said 
30 phenotype. 

33. A method according to claim 26 or 29, wherein said trait is schizophrenia or bipolar 
disorder. 
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34. A method according to claim 26 or 29, wherein said trait is predisposition to schizophrenia 
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or bipolar disorder, an early onset of schizophrenia or bipolar disorder, or a beneficial response 
to or side effects related to treatment against schizophrenia or bipolar disorder. 

35. A method according to claim 39, wherein said phenotype is a symptom of schizophrenia or 
bipolar disorder. 

36. A method according to claim 26 or 29, wherein said control population is a trait negative 
population. 

37. A method according to claim 26 or 29, wherein said case control population is a random 
population. 

38. A method of determining whether an individual is at risk of schizophrenia or bipolar 
disorder, comprising: 

a) genotyping at least one biallelic marker according to the method of claim 1 6; and 

b) correlating the result of step a) with a risk of developing schizophrenia or bipolar disorder. 

39. A diagnostic kit comprising a polynucleotide according to any one of claims 1 to 3 or 12. 

40. A purified or isolated g35030 polypeptide which is encoded by a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 17, and fragments 
or variants thereof. 

41. A purified or isolated g35030 polypeptide comprising at least 6 contiguous amino acid 
residues of any of SEQ ID Nos 1 8 to 23. 

42. A purified or isolated g35030 polypeptide according to claim 41, comprising at least one 
amino acid substitution, addition or deletion. 

43. A method for producing a g35030 polypeptide, wherein said method comprises the 
following steps: 

a) providing a cell host comprising a recombinant vector according to claim 4 containing a 
nucleic acid encoding a g35030 polypeptide; 

b) recovering the g35030 polypeptide produced by said recombinant cell host. 
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44. The method according to claim 43, wherein the recombinant cell host is a recombinant cell 
host according to claim 5. 

45. An isolated or purified antibody composition capable of selectively binding to a polypeptide 
according to claim 40 to 42. 

46. A method for specifically detecting the presence of a g3S030 polypeptide in a biological 
sample, said method comprising the following steps : 

a) bringing into contact the biological sample with an antibody directed against a g35030 
polypeptide according to any one of claims 40 to 42; 

b) detecting the antigen-antibody complex formed between said antibody and said polypeptide. 

47. A diagnostic kit for detecting in vitro the presence of a g35030 polypeptide in a biological 
sample, said kit comprising: 

a) a polyclonal or monoclonal antibody directed against a g35030 polypeptide according to any one 
of claims 40 to 42 or a fragment thereof, optionally labeled; 

b) a reagent allowing the detection of the antigen-antibody complexes formed between said g35030 
polypeptide and an antibody. 

48. A method for the screening of a candidate substance, wherein said method comprises the 
following steps: 

a) providing a polypeptide according to anyone of claims 40 to 42; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 

d) detecting the complexes formed between said polypeptide and said candidate substance. 

49. The method of claim 48, wherein at step d), the complexes formed are incubated in the 
presence of a polyclonal or a monoclonal antibody according to claim 45 . 

50. A kit for screening a candidate substance interacting with a g35030 polypeptide, wherein 
said kit comprises: 

a) a polypeptide according to anyone of claims 40 to 42; 

b) optionally a monoclonal or a polyclonal antibody according to claim 45. 

5 1 . A method for the screening of a candidate substance, where said method comprises the 
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following steps: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a nucleotide 
sequence encoding a g35030 protein or a variant or a fragment thereof, placed under the control 
of its own promoter; 
5 b) bringing into contact the cultivated cell with a molecule to be tested; 

c) quantifying the expression of the g35030 protein or a variant or a fragment thereof. 

52 A computer readable medium having stored thereon a sequence selected from the group 
consisting of a nucleic acid code comprising one of the following: 
10 a) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 

200, 500, 1000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, wherein 
said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 
1: 201 123 to 247802 and 199122 to 249803; 

b) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
1 5 200, 500, 1 000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 1 7, and the complements 

thereof, to the extent that such a length is consistent with the particular sequence ID; 

c) a contiguous span of at least 8, 12, 15, 18, 20,25, 30,35, 40, 45,50, 55, 60, 65, 70, 75, 
80, 90, 1 00 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, of SEQ ID Nos. 24, 25, 28, 74 and 79 to 132, or the complements thereof; 

20 d) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 

90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
span comprises at least one of the following nucleotide positions of SEQ ID No 1 : 201 123 to 
201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 21 7671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 

25 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 

e) a contiguous span according to a), b), c) or d), wherein said span includes a biallelic 
marker selected from the group consisting of Al to A 127; 

f) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 

30 wherein said span includes a polymorphism selected from the group consisting of A 13 to A18, 

A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 ; and 

g) a nucleotide sequence complementary to any one of the contiguous spans of a), b), c), 

d) , e), f), and g). 

35 53. A computer readable medium having stored thereon a sequence consisting of a polypeptide 



